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COMPOSITIONS AND METHODS FOR THE TREATMENT 
AND DIAGNOSIS OF BREAST CANCER 

TECHNICAL FIELD 

The present invention relates generally to the detection and therapy of 
5 breast cancer. The invention is more specifically related to nucleotide sequences that 
are preferentially expressed in breast tumor tissue and to polypeptides encoded by such 
nucleotide sequences. The nucleotide sequences and polypeptides may be used in 
vaccines and pharmaceutical compositions for the prevention and treatment of breast 
cancer. The polypeptides may also be used for the production of compounds, such as 
10 antibodies, useful for diagnosing and monitoring the progression of breast cancer in a 
patient. 

BACKGROUND OF THE INVENTION 

Breast cancer is a significant health problem for women in the United 
States and throughout the world. Although advances have been made in detection and 
15 treatment of the disease, breast cancer remains the second leading cause of cancer- 
related deaths in women, affecting more than 180,000 women in the United States each 
year. For women in North America, the life-time odds of getting breast cancer are now 
one in eight. 

No vaccine or other universally successful method for the prevention or 
20 treatment of breast cancer is currently available. Management of the disease currently 
relies on a combination of early diagnosis (through routine breast screening procedures) 
and aggressive treatment, which may include one or more of a variety of treatments 
such as surgery, radiotherapy, chemotherapy and hormone therapy. The course of 
treatment for a particular breast cancer is often selected based on a variety of prognostic 
25 parameters, including an analysis of specific tumor markers. See, e.g.. Porter- Jordan 
and Lippman, Breast Cancer 5:73-100 (1994). However, the use of estabHshed markers'' 
often leads to a result that is difficult to interpret, and the high mortality observed in 
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breast cancer patients indicates that improvements are needed in the treatment, 
diagnosis and prevention of the disease. 

Accordingly, there is a need in the art for improved methods for therapy 
and diagnosis of breast cancer. The present invention fulfills these needs and further 
5 provides other related advantages. 

SUMMARY OF THE INVENTION 

Briefly stated, the subject invention provides compositions and methods 
for the diagnosis and therapy of breast cancer. In one aspect, isolated DNA molecules 
are provided, comprising (a) a nucleotide sequence preferentially expressed in breast 

10 cancer tissue, relative to nomial tissue; (b) a variant of such a sequence that contains 
one or more, nucleotide substitutions, deletions, insertions and/or modifications at no 
more than 20% (preferably no more than 5%) of the nucleotide positions, such that the 
antigenic and/or immunogenic properties of the polypeptide encoded by the nucleotide 
sequence are retained; or (c) a nucleotide sequence encoding an epitope of a polypeptide 

15 encoded by at least one of the above sequences. In one embodiment, the isolated DNA 
molecule comprises a human endogenous retroviral sequence recited in SEQ ID NO:L 
In other embodiments, the isolated DNA molecule comprises a nucleotide sequence 
recited in any one of SEQ ID NO: 3-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 
178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 

20 250, 251, 253, 255, 257-266, 268, 269, 271^273, 275, 276, 278, 280, 281, 284, 288 and 
291-297. 

In related embodiments, the isolated DNA molecule encodes an epitope 
of a polypeptide, wherein the polypeptide is encoded by a nucleotide sequence that: (a) 
hybridizes to a sequence recited in any one of SEQ ID NO: 1, 3-26, 28-77, 142, 143, 
25 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 
219, 221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 
278, 280, 281, 284, 288 and 291-297 under stringent conditions; and (b) is at least 80%'» 
identical to a sequence recited in any one of SEQ ID NO: 1, 3-26, 28-77, 142, 143, 146- 
152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 
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221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 
280, 281, 284, 288 and 291-297; and wherein RNA corresponding to said nucleotide 
sequence is expressed at a greater level in human breast tumor tissue than in normal 
breast tissue. 

5 In another embodiment, the present invention provides an isolated DNA 

molecule encoding an epitope of a polypeptide, the polypeptide being encoded by: (a) a 
nucleotide sequence transcribed from the sequence of SEQ ID NO: 141; or (b) a variant 
of said nucleotide sequence that contains one or more nucleotide substitutions, 
deletions, insertions and/or modifications at no more than 20% of the nucleotide 

10 positions, such that the antigenic and/or immunogenic properties of the polypeptide 
encoded by the nucleotide sequence are retained. Isolated DNA and RNA molecules 
comprising a nucleotide sequence complementary to a DNA molecule as described 
above are also provided. 

In related aspects, the present invention provides recombinant expression 

15 vectors comprising a DNA molecule as described above and host cells transformed or 
transfected with such expression vectors. 

In further aspects, polypeptides, comprising an amino acid sequence 
encoded by a DNA molecule as described above, and monoclonal antibodies that bind 
to such polypeptides are provided. 

20 In yet another aspect, methods are provided for determining the presence 

of breast cancer in a patient. In one embodiment, the method comprises detecting, 
within a biological sample, a polypeptide as described above. In another embodiment, 
the method comprises detecting, within a biological sample, an RNA molecule 
encoding a polypeptide as described above. In yet another embodiment, the method 

25 comprises (a) intradermally injecting a patient with a polypeptide as described above; 
and (b) detecting an immune response on the patient's skin and therefrom detecting the 
presence of breast cancer in the patient. In further embodiments, the present invention 
provides methods for determining the presence of breast cancer in a patient as described ^, 
above wherein the polypeptide is encoded by a nucleotide sequence selected from the 

30 group consisting of SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 
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215, 217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 
285-287, 289, 290 and sequences that hybridize thereto under stringent conditions. 

In a related aspect, diagnostic kits useful in the determination of breast 
cancer are provided. The diagnostic kits generally comprise either one or more 
5 monoclonal antibodies as described above, or one or more monoclonal antibodies that 
bind to a polypeptide encoded by a nucleotide sequence selected from the group 
consisting of sequences provided in SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 
199, 205, 208, 215, 217, 220, 241, 242 and 246, 248, 249, 252, 256, 267, 270, 274, 277, 
279, 282, 283, 285-287, 289, 290 and a detection reagent. 

Within a related aspect, the diagnostic kit comprises a first polymerase 
Cham reaction primer and a second polymerase chain reaction primer, at least one of the 
primers being specific for an RNA molecule described herein. In one embodiment, at 
least one of the primers comprises at least about 10 contiguous nucleotides of an RNA 
molecule as described above, or an RNA molecule encoding a polypeptide encoded by a 
15 nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 144, 
145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242 246, 248, 249, 252, 
256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289 and 290. 

Within another related aspect, the diagnostic kit comprises at least one 
oHgonucleotide probe, the probe being specific for a DNA molecule described herein. 
20 In one embodiment, the probe comprises at least about 15 contiguous nucleotides of a 
DNA molecule as described above, or a DNA molecule selected from the group 
consisting of SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 
217, 220, 241, 242 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 
289 and 290. 

2^ In another related aspect, the present invention provides methods for 

monitoring the progression of breast cancer in a patient. In one embodiment, the 
method comprises: (a) detecting an amount, in a biological sample, of a polypeptide as 
described above at a first point in time; (b) repeating step (a) at a subsequent point in n 
time; and (c) comparing the amounts of polypeptide detected in steps (a) and (b), and 

30 therefrom monitoring the progression of breast cancer in the patient. In another 
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embodiment, the method comprises (a) detecting an amount, within a biological sample, 
of an RNA molecule encoding a polypeptide as described above at a first point in time; 
(b) repeating step (a) at a subsequent point in time; and (c) comparing the amounts of 
RNA molecules detected in steps (a) and (b), and therefrom monitoring the progression 
5 of breast cancer in the patient. In yet other embodiments, the present invention provides 
methods for monitoring the progression of breast cancer in a patient as described above 
wherein the polypeptide is encoded by a nucleotide sequence selected form the group 
consisting of SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 
217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285- 

10 287, 289, 290 and sequences that hybridize thereto under stringent conditions. 

In still other aspects, pharmaceutical compositions, which comprise a 
polypeptide as described above in combination with a physiologically acceptable 
carrier, and vaccines, which comprise a polypeptide as described above in combination 
with an immune response enhancer or adjuvant, are provided. In yet other aspects, the 

15 present invention provides pharmaceutical compositions and vaccines comprising a 
polypeptide encoded by a nucleotide sequence selected from the group consisting of 
SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 
242 and 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 
and sequences that hybridize thereto under stringent conditions. 

20 In related aspects, the present invention provides methods for inhibiting 

the development of breast cancer in a patient, comprising administering to a patient a 
phamiaceutical composition or vaccine as described above. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. All 

25 references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the differential display PGR products, separated by gel 
electrophoresis, obtained from cDNA prepared from normal breast tissue (lanes 1 and 2) 
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and from cDNA prepared from breast tumor tissue from the same patient (lanes 3 and 
4). The arrowlndicates the band corresponding to BlSAgl. 

Figure 2 is a northern blot comparing the level of BlSAgl mRNA in 
breast tumor tissue (lane 1) with the level in normal breast tissue. 
5 Figure 3 shows the level of BlSAgl mRNA in breast tumor tissue 

compared to that in various normal and non-breast tumor tissues as determined by 
RNase protection assays. 

Figure 4 is a genomic clone map showing the location of additional 
retroviral sequences obtained from ends of Xbal restriction digests (provided in SEQ ID 
10 NO:3 - SEQ ID NO: 10) relative to BlSAgl. 

Figures 5A and 5B show the sequencing strategy, genomic organization 
and predicted open reading frame for the retroviral element containing BlSAgl . 

Figure 6 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA BlSAgl. 
15 Figure 7 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA B17Agl. 

Figure 8 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B 1 7Ag2. 

Figure 9 shows the nucleotide sequence of the representative breast 
20 tumor-specific cDNA B 1 3 Ag2a. 

Figure 10 shows the nucleotide sequence of the representative breast 
tumor- specific cDNA B13Aglb. 

Figure 1 1 shows the nucleotide sequence of the representative breast 
tumor- specific cDNA BlSAgl a. 
25 Figure 12 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA B 1 1 Ag 1 . 

Figure 13 shows the nucleotide sequence of the representative breast 
tumor- specific cDNA B3CA3c. 

Figure 14 shows the nucleotide sequence of the representative breast 
30 tumor-specific CDNAB9CG1. 
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Figure 15 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B9CG3. 

Figure 16 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B2CA2. 
5 Figure 17 shows the nucleotide sequence of the representative breast 

tumor- specific cDNA B3CA 1 . 

Figure 18 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B3CA2. 

Figure 19 shows the nucleotide sequence of the representative breast 
10 tumor-specific cDNA B3CA3, 

Figure 20 shows the nucleotide sequence of the representative breast 
tumor- specific cDNA B4CA1. 

Figure 21 A depicts RT-PCR analysis of breast tumor genes in breast 
tumor tissues (lanes 1-8) and normal breast tissues (lanes 9-13) and H.O (lane 14). 
15 Figure 2 IB depicts RT-PCR analysis of breast tumor genes in prostate 

tumors (lane 1, 2), colon tumors (lane 3), lung tumor (lane 4), normal prostate (lane 5), 
normal colon (lane 6), normal kidney (lane 7), normal liver (lane 8), normal lung (lane 
9), normal ovary (lanes 10, 18), normal pancreases (lanes 11, 12), normal skeletal 
muscle (lane 13), normal skin (lane 14), normal stomach (lane 15), normal testes (lane 
20 16), normal small intestine (lane 17), HBL- 100 (lane 19), MCF-12A (lane 20), breast 
tumors (lanes 21-23), HjO (lane 24), and colon tumor (lane 25). 

DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally directed to 
compositions and methods for the diagnosis, monitoring and therapy of breast cancer. 
25 The compositions described herein include polypeptides, nucleic acid sequences and 
antibodies. Polypeptides of the present invention generally comprise at least a portion 
of a protein that is expressed at a greater level in human breast tumor tissue than in 
normal breast tissue (/.e., the level of RNA encoding the polypeptide is at least 2-fold 
higher in tumor tissue). Such polypeptides are referred to herein as breast tumor- 
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specific polypeptides, and cDNA molecules encoding such polypeptides are referred to 
as breast tumor-specific cDNAs. Nucleic acid sequences of the subject invention 
generally comprise a DNA or RNA sequence that encodes all or a portion of a 
polypeptide as described above, or that is complementary to such a sequence, 
5 Antibodies are generally immune system proteins, or fragments thereof, that are capable 
of binding to a portion of a polypeptide as described above. Antibodies can be 
produced by cell culture techniques, including the generation of monoclonal antibodies 
as described herein, or via transfection of antibody genes into suitable bacterial or 
mammalian cell hosts, in order to allow for the production of recombinant antibodies. 

^0 Polypeptides within the scope of this invention include, but are not 

hmited to, polypeptides (and epitopes thereof) encoded by a human endogenous 
retroviral sequence, such as the sequence designated BlSAgl (Figure 5 and SEQ ID 
NO:l). Also within the scope of the present invention are polypeptides encoded by 
other sequences within the retroviral genome containing BlSAgl (SEQ ID NO: 141). 

15 Such sequences include, but.are not limited to, the sequences recited in SEQ ID NO:3 - 
SEQ ID NO: 10. B 1 8Agl has homology to the gag p30 gene of the endogenous human 
retroviral element S71, as described in Werner et al.. Virology 77^:225-238 (1990) and 
also shows homology to about thirty other retroviral gag genes. As discussed in more 
detail below, the present invention also includes a number of additional breast tumor- 

20 specific polypeptides, such as those encoded by the nucleotide sequences recited in 
SEQ ID NO: 11-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 
200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 250, 251, 253, 255, 
257-266, 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 and 291-297. As used 
herein, the term "polypeptide" encompasses amino acid chains of any length, including 

25 full length proteins containing the sequences recited herein. A polypeptide comprising 
an epitope of a protein containing a sequence as described herein may consist entirely of 
the epitope, or may contain additional sequences. The additional sequences may be 
derived from the native protein or may be heterologous, and such sequences may (but', 
need not) possess immunogenic or antigenic properties. 
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An "epitope," as used herein is a portion of a polypeptide that is 
recognized (z.e, specifically bound) by a B-cell and/or T-cell surface antigen receptor. 
Epitopes may generally be identified using well known techniques, such as those 
summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 1993) 
5 and references cited therein. Such techniques include screening polypeptides derived 
fi-om the native polypeptide for the ability to react with antigen-specific antisera and/or 
T-cell lines or clones. An epitope of a polypeptide is a portion that reacts with such 
antisera and/or T-cells at a level that is similar to the reactivity of the full length 
polypeptide {e,g,, in an ELISA and/or T-cell reactivity assay). Such screens may 

10 generally be performed using methods well known to those of ordinary skill in the art, 
such as those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold 
Spring Harbor Laboratory, 1988. B-cell and T-cell epitopes may also be predicted via 
computer analysis. Polypeptides comprising an epitope of a polypeptide that is 
preferentially expressed in a tumor tissue (with or without additional amino acid 

15 sequence) are within the scope of the present invention. 

The compositions and methods of the present invention also encompass 
variants of the above polypeptides and nucleic acid sequences encoding such 
polypeptides. A polypeptide "variant," as used herein, is a polypeptide that differs fi-om 
the native polypeptide in substitutions and/or modifications, such that the antigenic 

20 and/or immunogenic properties of the polypeptide are retained. Such variants may 
generally be identified by modifying one of the above polypeptide sequences and 
evaluating the reactivity of the modified polypeptide with antisera and/or T-cells as 
described above. Nucleic acid variants may contain one or more substitutions, 
deletions, insertions and/or modifications such that the antigenic and/or immunogenic 

25 properties of the encoded polypeptide are retained. One preferred variant of the 
polypeptides described herein is a variant that contains nucleotide substitutions, 
deletions, insertions and/or modifications at no more than 20% of the nucleotide 
positions. 

Preferably, a variant contains conservative substitutions. A 
30 "conservative substitution" is one in which an amino acid is substituted for another 



BNSDOCID: <WO 9845328A2_I_> 



wo 98/45328 PCT/US98/06939 

10 

amino acid that has similar properties, such that one skilled in the art of peptide 
chemistiy would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. In general, the following groups of amino 
acids represent conservative changes: (1) ala, pro, gly, glu, asp, gin. asn, ser, thr; 
5 (2) cys, ser, tyr, thr; (3) val, ile. leu, met, ala. phe; (4) lys, arg, his; and (5) phe, tyr, trp. 
his. 

Variants may also (or alternatively) be modified by, for example, the 
deletion or addition of amino acids that have minimal influence on the immunogenic or 
antigenic properties, secondary structure and hydropathic nature of the polypeptide. For 
10 example, a polypeptide may be conjugated to a signal (or leader) sequence at the N- 
terminal end of the protein which co-translationally or post-translationally directs 
transfer of the protein. The polypeptide may also be conjugated to a linker or other 
sequence for ease of synthesis, purification or identification of the polypeptide (e.g., 
poly-His), or to enhance binding of the polypeptide to a solid support. For example, a 
1 5 polypeptide may be conjugated to an immunoglobulin Fc region. 

In general, nucleotide sequences encoding all or a portion of the 
polypeptides described herein may be prepared using any of several techniques For 
example, cDNA molecules encoding such polypeptides may be cloned on the basis of 
the breast tumor-specific expression of the corresponding mRNAs, using differential 
20 display PGR. This technique compares the amplified products from RNA template 
prepared from normal and breast tumor tissue. cDNA may be prepared by reverse 
transcription of RNA using a (dT)„AG primer. Following amplification of the cDNA 
using a random primer, a band corresponding to an amplified product specific to the 
tumor RNA may be cut out from a silver stained gel and subcloned into a suitable 
25 vector (e.g., the T-vector, Novagen, Madison. WI). Nucleotide sequences encoding all 
or a portion of the breast tumor-specific polypeptides disclosed herein may be amplified 
from cDNA prepared as described above using the random primers shown in SEQ ID 
NO.:87-125. 

Alternatively, a gene encoding a polypeptide as described herein (or a 
30 portion thereof) may be amplified from human genomic DNA. or from breast tumor 
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cDNA, via polymerase chain reaction. For this approach, BlSAgl sequence-specific 
primers may be designed based on the sequence provided in SEQ ID NO:l, and may be 
purchased or synthesized. One suitable primer pair for amplification from breast tumor 
cDNA is (5'ATG GCT ATT TTC GGG GGC TGA CA) (SEQ ID NO.:126) and 
5 (5'CCG GTA TCT CCT CGT GGG TAT T) (SEQ ID NO.:127). An amplified portion 
of B18Agl may then be used to isolate the full length gene from a human genomic 
DNA library or from a breast tumor cDNA library, using well known techniques, such 
as those described in Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratories, Cold Spring Harbor, NY (1989). Other sequences within 
10 the retroviral genome of which B18Agl is a part may be similarly prepared by 
screening human genomic libraries using BlSAgl -specific sequences as probes. 
Nucleotides translated into protein from the retroviral genome shown in SEQ ID NO: 
141 may then be determined by cloning the corresponding cDNAs, predicting the open 
reading frames and cloning the appropriate cDNAs into a vector containing a viral 
15 promoter, such as T7. The resulting constructs can be employed in a translation 
reaction, using techniques known to those of skill in the art, to identify nucleotide 
sequences which result in expressed protein. Similarly, primers specific for the 
remaining breast tumor-specific polypeptides described herein may be designed based 
on the nucleotide sequences provided in SEQ ID NO: 1 1 - SEQ ID NO:86 and SEQ ID 
20 NO: 142 - SEQ ID NO:297. 

Recombinant polypeptides encoded by the DNA sequences described 
above may be readily prepared from the DNA sequences. For example, supematants 
from suitable host/vector systems which secrete recombinant protein or polypeptide into 
culture media may be first concentrated using a commercially available filter. 

25 Following concentration, the concentrate may be applied to a suitable purification 
matrix such as an affinity matrix or an ion exchange resin. Finally, one or more reverse 
phase HPLC steps can be employed to further purify a recombinant polypeptide. 

In general, any of a variety of expression vectors known to those of, 
ordinary skill in the art may be employed to express recombinant polypeptides of this 

30 invention. Expression may be achieved in any appropriate host cell that has been 
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transfonned or transfected with an expression vector containing a DNA molecule that 
encodes a recombinant polypeptide. Suitable host cells include prokaiyotes, yeast and 
higher eukaiyotic cells. Preferably, the host cells employed are E. coli, yeast or a 
mammalian cell line such as COS or CHO. 
5 Such techniques may also be used to prepare polypeptides comprising 

epitopes or variants of the native polypeptides. For example, variants of a native 
polypeptide may generally be prepared using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis, and sections of the DNA sequence 
may be removed to permit preparation of truncated polypeptides. Portions and other 
10 variants having fewer than about 100 amino acids, and generally fewer than about 50 
amino acids, may also be generated by synthetic means, using techniques well known to 
those of ordinary skill in the art. For example, such polypeptides may be synthesized 
using any of the commercially available solid-phase techniques, such as the Merrifield 
solid-phase synthesis method, where amino acids are sequentially added to a growing 
15 amino acid chain. See Merrifield, J. Am. Chem. Soc. 85:2149-2146 (1963). Equipment 
for automated synthesis of polypeptides is commercially available fi-om suppliers such 
as Perkin Elmer/Applied BioSystems Division,, Foster City, CA, and may be operated 
according to the manufacturer's instructions. 

In specific embodiments, polypeptides of the present invention 
20 encompass amino acid sequences encoded by a DNA molecule having a sequence 
recited in any one of SEQ ID NO:l, 3-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 
178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 
250, 251, 253, 255. 257-266, 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 and 
291-297, variants of such polypeptides that are encoded by DNA molecules containing 
25 one or more nucleotide substitutions, deletions, insertions and/or modifications at no 
more than 20% of the nucleotide positions, and epitopes of the above polypeptides. 
Polypeptides within the scope of the presem invention also include polypeptides (and 
epitopes thereof) encoded by DNA sequences that hybridize to a DNA molecule having, 
a sequence recited in any one of SEQ ID NO:l, 3-26, 28-77, 142, 143, 146-152, 154- 
30 166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214. 216, 218, 219, 221-240, 
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243-245, 247, 250, 251. 253. 255, 257-266, 268, 269, 271-273. 275, 276, 278, 280, 281, 
284, 288 and 291-297 under stringent conditions, wherein the DNA sequences are at 
least 80% identical in overall sequence to a recited sequence and wherein RNA 
corresponding to the nucleotide sequence is expressed at a greater level in human breast 
5 tumor tissue than in normal breast tissue. As used herein, "stringent conditions" refers 
to prewashing in a solution of 6X SSC, 0.2% SDS; hybridizing at 65°C, 6X SSC, 0.2% 
SDS overnight; followed by two washes of 30 minutes each in IX SSC, 0.1% SDS at 
65°C and two washes of 30 minutes each in 0.2 X SSC. 0.1% SDS at 65°C. DNA 
molecules according to the present invention include molecules that encode any of the 
10 above polypeptides. 

In another aspect of the present invention, antibodies are provided. Such 
antibodies may be prepared by any of a variety of techniques known to those of 
ordinaiy skill in the art. See, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, 
Cold Spring Harbor Laboratory, 1988. In one such technique, an immunogen 
15 comprising the polypeptide is initially injected into any of a wide variety of mammals 
{e.g., mice, rats, rabbits, sheep or goats). In this step, the polypeptides of this invention 
may serve as the immunogen without modification. Alternatively, particularly for 
relatively short polypeptides, a superior immune response may be elicited if the 
polypeptide is joined to a carrier protein, such as bovine serum albumin or keyhole 
20 limpet hemocyanin. The immunogen is injected into the animal host, preferably 
according to a predetermined schedule incorporating one or more booster 
immunizations, and the animals are bled periodically. Polyclonal antibodies specific for 
the polypeptide may then be purified from such antisera by, for example, affinity 
chromatography using the polypeptide coupled to a suitable solid support. 

Monoclonal antibodies specific for the antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. 
Immunol. 5:51 1-519 (1976), and improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the. 
desired specificity {i.e. , reactivity with the polypeptide of interest). Such cell lines may 
30 be produced, for example, from spleen cells obtained from an animal immunized as 
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described above. The spleen cells are then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
animal. A variety of fusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
5 then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 
colonies of hybrids are observed. Single colonies are selected and their culture 
supematants tested for binding activity against the polypeptide. Hybridomas having 
1 0 high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
15 the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitafion, and 
extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography step. 

Antibodies may be used, for example, in methods for detecting breast 
20 cancer in a patient. Such methods involve using an antibody to detect the presence or 
absence of a breast tumor-specific polypeptide as described herein in a suitable 
biological sample. As used herein, suitable biological samples include tumor or normal 
tissue biopsy, mastectomy, blood, lymph node, serum or urine samples, or other tissue, 
homogenate, or extract thereof obtained fi-om a patient. 
25 There are a variety of assay formats known to those of ordinary skill in 

the art for using an antibody to detect polypeptide markers in a sample. See, e.g., 
Hariow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
1988. For example, the assay may be performed in a Western blot format, wherein 4^ 
protein preparation fi-om the biological sample is submitted to gel electrophoresis, 
30 transferred to a suitable membrane and allowed to react with the antibody. The 
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presence of the antibody on the membrane may then be detected using a suitable 
detection reagent, as described below. 

In another embodiment, the assay involves the use of antibody 
immobilized on a solid support to bind to the polypeptide and remove it from the 
5 remainder of the sample. The bound polypeptide may then be detected using a second 
antibody or reagent that contains a reporter group. Alternatively, a competitive assay 
may be utilized, in which a polypeptide is labeled with a reporter group and allowed to 
bind to the immobilized antibody after incubation of the antibody with the sample. The 
extent to which components of the sample inhibit the binding of the labeled polypeptide 

10 to the antibody is indicative of the reactivity of the sample with the immobilized 
antibody, and as a result, indicative of the concentration of polypeptide in the sample. 

The solid support may be any material known to those of ordinary skill 
in the art to which the antibody may be attached. For example, the solid support may be 
a test well in a microtiter plate or a nitrocellulose filter or other suitable membrane. 

15 Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. 

The antibody may be immobilized on the solid support using a variety of 
20 techniques known to those in the art, which are amply described in the patent and 
scientific literature. In the context of the present invention, the term "immobilization" 
refers to both noncovalent association, such as adsorption, and covalent attachment 
(which may be a direct linkage between the antigen and functional groups on the 
support or may be a linkage by way of a cross-linking agent). Immobilization by 
25 adsorption to a well in a microtiter plate or to a membrane is preferred. In such cases, 
adsorption may be achieved by contacting the antibody, in a suitable buffer, with the 
solid support for a suitable amount of time. The contact time varies with temperature, 
but is typically between about 1 hour and 1 day. In general, contacting a well of a., 
plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of 
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antibody ranging from about lOng to about 1 ^g, and preferably about 100-200 ng, is 
sufficient to iinmobilize an adequate amount of polypeptide. 

Covalent attachment of antibody to a solid support may also generally be 
achieved by first reacting the support with a bifunctional reagent that will react with 
5 both the support and a functional group, such as a hydroxyl or amino group, on the 
antibody. For example, the antibody may be covalently attached to supports having an 
appropriate polymer coating using benzoquinone or by condensation of an aldehyde 
group on the support with an amine and an active hydrogen on the binding partner (see, 
e.g.. Pierce Immunotechnology Catalog and Handbook (1991) at A12-A13). 

^ csi^^i" embodiments, the assay for detection of polypeptide in a 
sample is a two-antibody sandwich assay. This assay may be performed by first 
contacting an antibody that has been immobilized on a solid support, commonly the 
well of a microliter plate, with the biological sample, such that the polypeptide within 
the sample are allowed to bind to the immobilized antibody. Unbound sample is then 
1 5 removed from the immobilized polypeptide-antibody complexes and a second antibody 
(containing a reporter group) capable of binding to a different site on the polypeptide is 
added. The amount of second antibody that remains bound to the solid support is then 
determined using a method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 
20 described above, the remaining protein binding sites on the support are typically 
blocked. Any suitable blocking agent known to those of ordinary skill in the art, such 
as bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 
immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to the antibody. The sample may be diluted with a suitable diluent, such as 
25 phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time (i.e., incubation time) is that period of time that is sufficient to detect the presence 
of polypeptide within a sample obtained from an individual with breast cancer. 
Preferably, the contact time is sufficient to achieve a level of binding that is at least,. 
95% of that achieved at equilibrium between bound and unbound polypeptide. Those of 
30 ordinary skill in the art will recognize that the time necessary to achieve equilibrium 
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may be readily determined by assaying the level of binding that occurs over a period of 
time. At room temperature,, an incubation time of about 30 minutes is generally 
sufficient. 

Unbound sample may then be removed by washing the solid support 
5 with an appropriate buffer, such as PBS containing 0.1% Tween 20"^'^. The second 
antibody, which contains a reporter group, may then be added to the solid support. 
Preferred reporter groups include enzymes (such as horseradish peroxidase), substrates, 
cofactors, inhibitors, dyes, radionuclides, luminescent groups, fluorescent groups and 
biotin. The conjugation of antibody to reporter group may be achieved using standard 

10 methods known to those of ordinary skill in the art. 

The second antibody is then incubated with the immobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 
An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound second antibody is then removed 

15 and bound second antibody is detected using the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
and fluorescent groups. Biotin may be detected using avidin, coupled to a different 

20 reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

To determine the presence or absence of breast cancer, the signal 
25 detected from the reporter group that remains bound to the solid support is generally 
compared to a signal that corresponds to a predetermined cut-off value established from 
non-tumor tissue. In one preferred embodiment, the cut-off value is the average mean 
signal obtained when the immobilized antibody is incubated with samples from patients^, 
without breast cancer. In general, a sample generating a signal that is three standard 
30 deviations above the predetermined cut-off value may be considered positive for breast 
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cancer. In an alternate preferred embodiment, the cut-off value is determined using a 
Receiver Operator Curve, according to the method of Sackett et ah. Clinical 
Epidemiology: A Basic Science for Clinical Medicine, p. 106-7 (Little Brown and Co., 
1985). Briefly, in this embodiment, the cut-off value may be determined from a plot of 
5 pairs of true positive rates {i.e., sensitivity) and false positive rates (100%-specificity) 
that correspond to each possible cut-off value for the diagnostic test result. The cut-off 
value on the plot that is the closest to the upper left-hand comer {i.e., the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
signal that is higher than the cut-off value determined by this method may be considered 
10 positive. Alternatively, the cut-off value may be shifted to the left along the plot, to 
minimize the false positive rate, or to the right, to minimize the false negative rate. In 
general, a sample generating a signal that is higher than the cut-off value determined by 
this method is considered positive for breast cancer. 

In a related embodiment, the assay is performed in a flow-through or 
15 strip test format, wherein the antibody is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test, the polypeptide within the sample bind to the 
immobilized antibody as the sample passes through the membrane. A second, labeled 
antibody then binds to the antibody-polypeptide complex as a solution containing the 
second antibody flows through the membrane. The detection of bound second antibody 
20 may then be performed as described above. In the strip test format, one end of the 
membrane to which antibody is bound is immersed in a solution containing the sample. 
The sample migrates along the membrane through a region containing second antibody 
and to the area of immobilized antibody. Concentration of second antibody at the area 
of immobilized antibody indicates the presence of breast cancer. Typically, the 
25 concentration of second antibody at that site generates a pattern, such as a line, that can 
be read visually. The absence of such a pattern indicates a negative result. In general, 
the amount of antibody immobilized on the- membrane is selected to generate a visually 
discernible pattern when the biological sample contains a level of polypeptide that., 
would be sufficient to generate a positive signal in the two-antibody sandwich assay, in 
30 the format discussed above. Preferably, the amount of antibody immobilized on the 
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membrane ranges from about 25 ng to about 1 ^g, and more preferably from about 50 
ng to about 1 lag. Such tests can typically be performed with a very small amount of 
biological sample. 

The presence or absence of breast cancer in a patient may also be 
5 detennined by evaluating the level of mRNA encoding a breast tumor-specific 
polypeptide as described herein within the biological sample (e.g., a biopsy, 
mastectomy and/pr blood sample from a patient) relative to a predetermined cut-off 
value. Such an evaluation may be achieved using any of a variety of methods known to 
those of ordinary skill in the art such as, for example, in situ hybridization and 
10 ampHfication by polymerase chain reaction. 

For example, polymerase chain reaction may be used to amplify 
sequences from cDNA prepared from RNA that is isolated from , one of the above 
biological samples. Sequence-specific primers for use in such amplification may be 
designed based on the sequences provided in any one of SEQ ID NO: 1, 1 1-86 and 142- 
15 297, and may be purchased or synthesized. In the case of BlSAgl, as noted herein, one 
suitable primer pair is BlSAgl -2 (5'ATG GCT ATT TTC GGG GGC TGA CA) (SEQ 
ID NO.:126) and B18Agl-3 (5'CCG GTA TCT CCT CGT GGG TAT T) (SEQ ID 
NO.: 127). The PGR reaction products may then be separated by gel electrophoresis and 
visualized according to methods well known to those of ordinary skill in the art. 
20 Amplification is typically performed on samples obtained from matched pairs of tissue 
(tumor and non-tumor tissue from the same individual) or from unmatched pairs of 
tissue (tumor and non-tumor tissue from different individuals). The amplification 
reaction is preferably performed on several dilutions of cDNA spanning two orders of 
magnitude. A two-fold or greater increase in expression in several dilutions of the 
25 tumor sample as compared to the same dilution of the non-tumor sample is considered 
positive. 

As used herein, the term "primer/probe specific for a DNA/RNA 
molecule" means an oligonucleotide sequence that has at least about 80% identity 
preferably at least about 90% and more preferably at least about 95%, identity to the 
30 DNA/RNA molecule in question. Primers and/or probes which may be usefully 
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employed in the inventive diagnostic methods preferably have at least about 10-40 
nucleotides. In a preferred embodiment, the polymerase chain reaction primers 
comprise at least about 10 contiguous nucleotides of a DNA/RNA molecule encoding 
one of the polypeptides disclosed herein. Preferably, oligonucleotide probes for use in 
5 the inventive diagnostic methods comprise at least about 15 contiguous 
oligonucleotides of a DNA/RNA molecule encoding one of the polypeptides disclosed 
herein. Techniques for both PGR based assays and in situ hybridization assays are well 
known in the art. 

Conventional RT-PCR protocols using agarose and ethidium bromide 
10 staining while important in defining gene specificity do not lend themselves to 
diagnostic kit development because of the time and effort required in making them 
quantitative (i.e., construction of saturation and/or titration curves), and their sample 
throughput. This problem is overcome by the development of procedures such as real 
time RT-PCR which allows for assays to be performed in single tubes, and in turn can 
15 be modified for use in 96 well plate formats. Instrumentation to perform such 
methodologies are available fi-om Perkin Elmer/AppHed Biosystems Division. 
Alternatively, other high throughput assays using labeled probes (e.g., digoxygenin) in 
combination with labeled (e.g., enzyme fluorescent, radioactive) antibodies to such 
probes can also be used in the development of 96 well plate assays. 

I" another method for determining the presence or absence of breast 
cancer in a patient, one or more of the breast tumor-specific polypeptides described 
may be used in a skin test. As used herein, a "skin test" is any assay performed directly 
on a patient in which a delayed-type hypersensitivity (DTH) reaction (such as swelling, 
reddening or dermatitis) is measured following intradermal injection of one or more 
25 polypeptides as described above. Such injection may be achieved using any suitable 
device sufficient to contact the polypeptide or polypeptides with dermal cells of the 
patient, such as a tuberculin syringe or 1 mL syringe. Preferably, the reaction is 
measured at least 48 hours after injection, more preferably 48-72 hours. 

The DTH reaction is a cell-mediated immune response, which is greater 
30 in patients that have been exposed previously to a test antigen {i.e., an immunogenic 
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portion of a polypeptide employed, or a variant thereof). The response may measured 
visually, using a ruler. In general, a response that is greater than about 0.5 cm in 
diameter, preferably greater than about 5.0 cm in diameter, is a positive response, 
indicative of breast cancer. 
5 The breast tumor-specific polypeptides described herein are preferably 

formulated, for use in a skin test, as pharmaceutical compositions containing at least 
one polypeptide and a physiologically acceptable carrier, such as water, saline, alcohol, 
or a buffer. Such compositions typically contain one or more of the above polypeptides 
in an amount ranging from about 1 j^g to 100 ^g, preferably from about 10 ^g to 50 pg 
10 in a volume of 0.1 mL. Preferably, the carrier employed in such pharmaceutical 
compositions is a saline solution with appropriate preservatives, such as phenol and/or 
Tween SO"^*^. 

In other aspects of the present invention, the progression and/or response 
to treatment of a breast cancer may be monitored by performing any of the above assays 

15 over a period of time, and evaluating the change in the level of the response (i.e., the 
amount of polypeptide or mRNA detected or, in the case of a skin test, the extent of the 
immune response detected). For example, the assays may be performed every month to 
every other month for a period of 1 to 2 years. In general, breast cancer is progressing 
in those patients in whom the level of the response increases over time. In contrast. 

20 breast cancer is not progressing when the signal detected either remains constant or 
decreases with time. 

In further aspects of the present invention, the compounds described 
herein may be used for the immunotherapy of breast cancer. In these aspects, the 
compounds (which may be polypeptides, antibodies or nucleic acid molecules) are 

25 preferably incorporated into pharmaceutical compositions or vaccines. Pharmaceutical 
compositions comprise one or more such compounds and a physiologically acceptable 
earner. Vaccines may comprise one or more polypeptides and an immune response 
enhancer, such as an adjuvant or a liposome (into which the compound is incorporated) 
Pharmaceutical compositions and vaccines may additionally contain a delivery system, 

30 such as biodegradable microspheres which are disclosed, for example, in U.S. Patent 



wo 98/45328 

PCT/US98/06939 

22 

Ncs. 4,897,268 and 5,075.109. Phan.aceu.ical composition, and vaccines within the 
scope of the present invention ntay also contain other compotmds. including one or 

more separate polypeptides. 

Alternatively, a vaccine may contait, DNA encoding one or more of the 
polypeptides as described above, such that the polypeptide is generated I„ such 

vaccines, the DNA may be present wia,in any of a variety of delivery systems known to 
.hose of ordinary skill in the an. including nucleic acid expression systems, bacteria and 
vral expression systems. Appropriate nucleic acid expression systems contain .he 
necessary DNA sequences for expression in tlre patient (such as a suitable promoter and 
.0 termmating signal). Bacteria, delivety systems involve the administration of a 
bacterium (such as Bacmus-CaI„,e„e.Guern„) that expresses an immunogenic ponion 
of .he polypeptide on i.s cell surface, to a preferred embodiment, the DNA may be 
mtroduced using a vira, expression system ie.,.. vaccinia or other pox virus, retrovirus 
or adenovinrs), which may mvolve the use of a non-padtogenic (defective), replication' 
competent vinrs. Techniques for incorporating DNA into such expression systems are 
well icnown to .hose of ordinate skill in the an. The DNA may also be "naked " as 
descnbed, for example, in Ulmer et al.. Science 259:1745-1749 (1993). and reviewed by 
Cohen, Science 2iP:,69.-,692 (1993). The uptake of naked DNA may be increased by 

coahng .he DNA onto biodegradable beads, which are efficiently transporied into the 

20 cells. 

While any suitable carrier known to those of ordinary skill in the ar, may 
be employed in the pha^aceutical compositions of this invention, the type of catrier 
w,ll vao. depending on dre mode of administration. For parenteral administration, such 
as subcutaneous injection, the earner preferably comprises water, saline, alcohol, a fat a 
wax or a buffer. For oral administration, any of the above Carrie,, or a solid catrier 
such as mamtitol, lactose, starch, magnesium stearate, sodium saccharine, talcum' 
cellulose, glucose, sucrose, and magnesium carbonate, may be employed' 
Btodegradable microspheres (e^., polylactate polyglycolate) may also be employed a^ 
earners for the phaimaceutical compositions of this invention. 
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Any of a variety of adjuvants may be employed in the vaccines of this 
invention to nonspecifically enhance the immune response. Most adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum 
hydroxide or mineral oil. and a stimulator of immune responses, such as lipid A, 
5 Bortadella pertussis or Mycobacterium tuberculosis derived proteins. Suitable 
adjuvants are commercially available as, for example, Freund's Incomplete Adjuvant 
and Complete Adjuvant (Difco Laboratories, Detroit, MI), Merck Adjuvant 65 (Merck 
and Company, Inc., Rahway, NJ). alum, biodegradable microspheres, monophosphoiyl 
lipid A and quil A. Cytokines, such as GM-CSF or interleukin-2, -7, or -12, may also 
1 0 be used as adjuvants. 

The above pharmaceutical compositions and vaccines may be used, for 
example, for the therapy of breast cancer in a patient. As used herein, a "patient" refers 
to any warm-blooded animal, preferably a human. A patient may or may not be 
afflicted with breast cancer. Accordingly, the above pharmaceutical compositions and 
1 5 vaccines may be used to prevent the development of breast cancer or to treat a patient 
afflicted with breast cancer. To prevent the development of breast cancer, a 
pharmaceutical composition or vaccine comprising one or more polypeptides as 
described herein may be administered to a patient. Alternatively, naked DNA or 
plasmid or viral vector encoding the polypeptide may be administered. For treating a 
20 patient with breast cancer, the pharmaceutical composition or vaccine may comprise 
one or more polypeptides, antibodies or nucleotide sequences complementary to DNA 
encoding a polypeptide as described herein (e.g., antisense RNA or antisense 
deoxyribonucleotide oligonucleotides). 

Routes and frequency of administration, as well as dosage, will vary 
25 from individual to individual. In genera], the phannaceutical compositions and 
vaccines may be administered by injection (e.g., intracutaneous, intramuscular, 
intravenous or subcutaneous), intranasally {e.g., by aspiration) or orally. Between 1 and 
10 doses may be administered for a 52-week period. Preferably, 6 doses arc 
administered, at intervals of 1 month, and booster vaccinations may be given 
.30 periodically thereafter. Alternate protocols may be appropriate for individual patients. 
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A suitable dose is an amount of a compound that, when administered as described 
above, is capable of promoting an anti-tumor immune response. Such response can be 
monitored by measuring the anti-tumor antibodies in a patient or by vaccine-dependent 
generation of cytolytic effector cells capable of killing the patient's tumor cells in vitro. 
Such vaccines should also be capable of causing an immune response that leads to an 
improved clinical outcome {e.g., more frequent remissions, complete or partial or longer 
disease-free survival) in vaccinated patients as compared to non-vaccinated patients. In 
general, for pharmaceutical compositions and vaccines comprising one or more 
polypeptides, the amount of each polypeptide present in a dose ranges from about 
100 }ig to 5 mg. Suitable dose sizes will vary with the size of the patient, but will 
typically range from about 0.1 mL to about 5 mL. 



The following Examples are offered by way of illustration and not by 
way of limitation. 
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EXAMPLES 
EXAMPLE 1 

Preparation of Breast Tumor-Specific cDNAs Using 
5 Differential Display RT-PCR 

This Example illustrates the preparation of cDNA molecules encoding 
breast tumor-specific polypeptides using a differential display screen. 

A. Preparation of BlSAgl cDNA and Characterization of niRNA Expression 

10 Tissue samples were prepared from breast tumor and normal tissue of a 

patient with breast cancer that was confirmed by pathology after removal from the 
patient. Normal RNA and tumor RNA was extracted from the samples and mRNA was 
isolated and converted into cDNA using a (dT)i2AG (SEQ ID NO.: 130) anchored 3' 
primer. Differential display PCR was then executed using a randomly chosen primer 

15 (CTTCAACCTC) (SEQ ID NO.: 103). Amplification conditions were standard buffer 
containing 1.5 mM MgClj, 20 pmol of primer, 500 pmol dNTP, and 1 unit of Tag DNA 
polymerase (Perkin-Elmer, Branchburg, NJ). Forty cycles of amplification were 
performed using 94°C denaturation for 30 seconds, 42°C annealing for 1 minute, and 
72°C extension for 30 seconds. An RNA fingerprint containing 76 amplified products 

20 was obtained. Although the RNA fingerprint of breast tumor tissue was over 98% 
identical to that of the normal breast tissue, a band was repeatedly observed to be 
specific to the RNA fingerprint pattern of the tumor. This band was cut out of a silver 
stained gel, subcloned into the T-vector (Novagen, Madison, WI) and sequenced. 

The sequence of the cDNA, referred to as BlSAgl, is provided in SEQ 

25 ID N0:1. A database search of GENBANK and EMBL revealed that the BlSAgl 
fragment initially cloned is 77% identical to the endogenous human retroviral element 
S71, which is a truncated retroviral element homologous to the Simian Sarcoma Virus., 
(SSV). S71 contains an incomplete gag gene, a portion of the pol gene and an LTR-like 
structure at the 3' temiinus {see Werner et al, Virology 77^:225-238 (1990)). BlSAgl 
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is also 64% identical to SSV in the region corresponding to the P30 (gag) locus. 
BlSAgl contains three separate and incomplete reading frames covering a region which 
shares considerable homology to a wide variety of gag proteins of retroviruses which 
infect mammals. In addition, the homology to S71 is not just within the gag gene, but 
5 spans several kb of sequence including an LTR. 

BlSAgl -specific PGR primers were synthesized using computer analysis 
guidelines. RT-PCR amplification (94»C, 30 seconds; 60°C 42°C, 30 seconds; 72°C, 
30 seconds for 40 cycles) confirmed that B18Agl represents an actual mRNA sequence 
present at relatively high levels in the patient's breast tumor tissue. The primers used 
10 in amphfication were B18Agl-l (CTG CCT GAG CCA CAA ATG) (SEQ ID NO.:128) 
and B18Agl-4 (CCG GAG GAG GAA GCT AGA GGA ATA) (SEQ ID NO.: 129) at a 
3.5 mM magnesium concentration and a pH of 8.5, and B18Agl-2 (ATG GCT ATT 
TTC GGG GCC TGA CA) (SEQ ID NO.:126) and B18Agl-3 (CCG GTA TCT CCT 
CGT GGG TAT T) (SEQ ID NO.:127) at 2 mM magnesium at pH 9.5. The same 
15 experiments showed exceedingly low to nonexistent levels of expression in this 
patient's normal breast tissue {see Figure 1). RT-PCR experiments were then used to 
show that Bl 8Agl mRNA is present in nine other breast tumor samples (from Brazilian 
and American patients) but absent in, or at exceedingly low levels in, the normal breast 
tissue corresponding to each cancer patient. RT-PCR analysis has also shown that the 
20 BlSAgl transcript is not present in various normal tissues (including lymph node, 
myocardium and liver) and present at relatively low levels in PBMC and lung tissue. 
The presence of BlSAgl mRNA in breast tumor samples, and its absence from normal 
breast tissue, has been confirmed by Northern blot analysis, as shown in Figure 2. 

The differential expression of BlSAgl in breast tumor tissue was also 
25 confirmed by RNase protection assays. Figure 3 shows the level of Bl SAgl mRNA in 
various tissue types as detemiined in four different RNase protection assays. Lanes 1- 
12 represent various normal breast tissue samples, lanes 13-25 represent various breast 
tumor samples; lanes 26-27 represent normal prostate samples; lanes 28-29 represent, 
prostate tumor samples; lanes 30-32 represent colon tumor samples; lane 33 represents 
30 normal aorta; lane 34 represents normal small intestine; lane 35 represents normal skin. 
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lane 36 represents normal lymph node; lane 37 represents normal ovary; lane 38 
represents normal liver; lane 39 represents normal skeletal muscle; lane 40 represents a 
first normal stomach sample, lane 41 represents a second normal stomach sample; lane 
42 represents a normal lung; lane 43 represents normal kidney; and lane 44 represents 
5 normal pancreas. Interexperimental comparison was facilitated by including a positive 
control RNA of known P-actin message abundance in each assay and normalizing the 
results of the different assays with respect to this positive control. 

RT-PCR and Southern Blot analysis has shown the BlSAgl locus to be 
present in human genomic DNA as a single copy endogenous retroviral element. A 

10 genomic clone of approximately 12-18 kb was isolated using the initial BlSAgl 
sequence as a probe. Four additional subclones were also isolated by Xbal digestion. 
Additional retroviral sequences obtained from the ends of the Xbal digests of these 
clones (located as shown in Figure 4) are shown as SEQ ID NO:3 - SEQ ID NO: 10, 
where SEQ ID NO:3 shows the location of the sequence labeled 10 in Figure 4, SEQ ID 

15 NO:4 shows the location of the sequence labeled 11-29, SEQ ID NO:5 shows the- 
location of the sequence labeled 3, SEQ ID NO:6 shows the location of the sequence 
labeled 6, SEQ ID NO:7 shows the location of the sequence labeled 12, SEQ ID NO:8 
shows the location of the sequence labeled 13, SEQ ID NO:9 shows the location of the 
sequence labeled .14 and SEQ ID NO: 10 shows the location of the sequence labeled 11- 

20 22. 

Subsequent studies demonstrated that the 12-18 kb genomic clone 
contains a retroviral element of about 7.75 kb, as shown in Figures 5A and 5B. The 
sequence of this retroviral element is shown in SEQ ID NO: 141. The numbered line at 
the top of Figure 5A represents the sense strand sequence of the retroviral genomic 

25 clone. The box below this hne shows the position of selected restriction sites. The 
arrows depict the different overiapping clones used to sequence the retroviral element. 
The direction of the arrow shows whether the single-pass subclone sequence 
corresponded to the sense or anti-sense strand. Figure 5B is a schematic diagram of the 
retroviral element containing BlSAgl depicting the organization of viral genes within 

30 the element. The open boxes conrespond to predicted reading frames, starting with a 



wo 98/45328 



PCT/US98/06939 



28 

methionine, found throughout the element. Each of the six hkely reading frames is 
shown, as indicated to the left of the boxes, with frames 1-3 corresponding to those 
found on the sense strand. 

Using the cDNA of SEQ ID NO:l as a probe, a longer cDNA was 
5 obtained (SEQ ID NO:227) which contains minor nucleotide differences (less than 1%) 
compared to the genomic sequence shown in SEQ ID NO: 141. 

B. Preparation of cDNA Molecules Fncoding Other Breast Tumor-Sp ecific 
Polypeptides 

Normal RNA and tumor RNA was prepared and mRNA was isolated and 

10 converted into cDNA using a (dT)i2AG anchored 3' primer, as described above. 
Differential display PGR was then executed using the randomly chosen primers SEQ 
ID NO.: 87-125. Amphfication conditions were as noted above, and bands observed to 
be specific to the RNA fingerprint pattern of the tumor were cut out of a silver stained 
gel, subcloned into either the T- vector (Novagen, Madison, WI) or the pCRII vector 

15 (Invitrogen, San Diego, CA) and sequenced. The sequences are provided in SEQ ID 
NO:l 1 - SEQ ID NO:86. Of the 79 sequences isolated, 67 were found to be novel (SEQ 
ID NO. :1 1-26 and 28-77) (see also Figures 6-20). 

An extended DNA sequence (SEQ ID NO: 290) for the antigen BlSAgl 
(originally identified partial sequence provided in SEQ ID NO: 27) was obtained in 

20 further studies. Comparison of the sequence of SEQ ID NO: 290 with those in the gene 
bank as described above, revealed homology to the known human p-A activin gene. 

Subsequent studies identified an additional 146 sequences (SEQ ID 
NOS:142-289), of which 115 appeared to be novel (SEQ ID NOS:142, 143, 146-152, 
154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221- 

25 240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 280, 
281, 284, 288 and 291). To the best of the inventors' knowledge none of the previously 
identified sequences have heretofore been shown to be expressed at a greater level in 
human breast tumor tissue than in normal breast tissue. 

In flirther studies, six different splice forms of the antigen Bl lAgl were 

30 isolated, with each of the various splice forms containing shghtly different versions of 
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the BllAgl coding frame. Splice junction sequences define individual exons which, in 
various patterns and arrangements, make up the various splice forms. Primers were 
designed to examine the expression pattern of each of the exons using RT-PCR as 
described below. Each exon was found to show the same expression pattern as the 
5 original BllAgl clone, with expression being breast tumor, prostate and testis-specific. 
The determined cDNA sequences for the isolated protein coding exons are provided in 
SEQ ID NO: 292-297, respectively. 

EXAMPLE 2 

1 0 Preparation of B 1 8 Ac 1 DNA from Human Genomic DNA 

This Example illustrates the preparation of BlSAgl DNA by 
amplification from human genomic DNA. 

BlSAgl DNA may be prepared from 250 ng human genomic DNA 
15 using 20 pmol of BlSAgl specific primers, 500 pmol dNTPS and 1 unit of Ta^ DNA 
polymerase (Perkin Elmer, Branchburg, NJ) using the following amplification 
parameters: 94°C for 30 seconds denaturing, 30 seconds 60°C to 42*^0 touchdown 
annealing in 2°C increments every two cycles and 72°C extension for 30 seconds. The 
last increment (a 42''C annealing temperature) should cycle 25 times. Primers were 
20 selected using computer analysis. Primers synthesized were B18Agl-l, BlSAgl -2, 
B18Agl-3, and B18Agl-4. Primer pairs that may be used are l-f-3, 1+4, 2+3, and 2+4. 

Following gel electrophoresis, the band corresponding to BlSAgl DNA 
may be excised and cloned into a suitable vector. 

25 EXAMPLE 3 

Preparation of B 1 8 Ac 1 DNA from Breast Tumor cDNA 

This Example illustrates the preparation of BlSAgl DNA by. 
amplification from human breast tumor cDNA. 
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First strand cDNA is synthesized from RNA prepared from human breast 
tumor tissue in a reaction mixture containing 500 ng poly A+ RNA, 200 pmol of the 
primer (T).,AG TTT TTT TTT TTT AG) (SEQ ID NO: 130), IX first strand 
reverse transcriptase buffer, 6.7 mM DTT, 500 mmol dNTPs, and 1 unit AMV or 
5 MMLV reverse transcriptase (from any supplier, such as Gibco-BRL (Grand Island 
NY)) in a final volume of 30 ^ After first strand synthesis, the cDNA is diluted 
approximately 25 fold and 1 ^ is used for amplification as described in Example 2 
While some primer pairs can result in a heterogeneous population of transcripts, the 
pnmers B18Agl-2 (5'ATG GCT ATT TTC GGG GGC TGA CA) (SEQ ID NO- 126) 
and B18Agl-3 (5'CCG GTA TCT CCT CGT GGG TAT T) (SEQ ID NO: 127) yield a 
smgle 151 bp amplification product. 

EXAMPLE 4 

IDENTIFICATION OF B-CELL AND T-CELL EPITOPES OF B 18Ag1 



This Example illustrates the identification of BlSAgl epitopes. 
The BlSAgl sequence can be screened using a variety of computer 
algorithms. To determine B-cell epitopes, the sequence can be screened for 
hydrophobicity and hydrophilicity values using the method of Hopp, Prog. Clin Biol. 
20 Res. 172B:361.11 (1985) or, alternatively, Cease et al., J. Exp. Med. 7^^:1779-84 
(1986) or Spouge et al., J. Immunol. 755:204-12 (1987). Additional Class II xMHC 
(antibody or B-cell) epitopes can be predicted using programs such as AMPHI {e.g. 
Margalit et al., J. Immunol. 138:2213 (1987)) or the methods of Rothbard and Taylor 
(.e.g., EMBO J. 7:93 (1988)). 

^""^^ P"P*'^^^ (15-20 amino acids long) are identified using these 
techniques, individual peptides can be synthesized using automated peptide synthesis 
equipment (available from manufacturers such as Perkin Elmer/Applied Biosystems 
Division, Foster City, CA) and techniques such as Memfield synthesis. Following, 
synthesis, the peptides can used to screen sera harvested from either normal or breast' 
30 cancer patients to detennine whether patients with breast cancer possess antibodies 
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reactive with the peptides. Presence of such antibodies in breast cancer patient would 
confirm the immunogenicity of the specific B-cell epitope in question. The peptides 
can also be tested for their ability to generate a serologic or humoral immune in animals 
(mice, rats, rabbits, chimps etc.) following immunization in vivo. Generation of a 
5 peptide-specific antiserum following such immunization further confirms the 
immunogenicity of the specific B-cell epitope in question. 

To identify T-cell epitopes, the BlSAgl sequence can be screened using 
different computer algorithms which are useful in identifying 8-10 amino acid motifs 
within the BlSAgl sequence which are capable of binding to HLA Class I MHC 

10 molecules, {see, e.g., Rammensee et aL, Immnnogenetics 4 J:\7S-22S (1995)). 
Following synthesis such peptides can be tested for their ability to bind to class I MHC 
using standard binding assays (e.g., Sette et al., J. Immunol 755:5586-92 (1994)) and 
more importantly can be tested for their ability to generate antigen reactive cytotoxic T- 
cells following in vitro stimulation of patient or normal peripheral mononuclear cells 

15 using, for example, the methods of Bakker et al., Cancer Res. 55:5330-34 (1995); 
Visseren et al., J. Immunol. 75^:3991-98 (1995); Kawakami et al., J. Immunol. 
75^:3961-68 (1995); and Kast et al., J. Immunol. 752:3904-12 (1994). Successful 
in vitro generation of T-cells capable of killing autologous (bearing the same Class I 
MHC molecules) tumor cells following in vitro peptide stimulation further confirms the 

20 immunogenicity of the B18Agl antigen. Furthermore, such peptides may be used to 
generate murine peptide and BlSAgl reactive cytotoxic T-cells following in vivo 
immunization in mice rendered transgenic for expression of a particular human MHC 
Class I haplotype (Vitiello et al., J. Exp. Med. 773:1007-15 (1991). 

A representative list of predicted BlSAgl B-cell and T-cell epitopes, 

25 broken down according to predicted HLA Class I MHC binding antigen, is shown 
below: 



30 



Predicted T h Motifs rR-cell epitopes^ rSRQ TP NO.^ • 131-133) 
SSGGRTFDDFHRYLLVGI 
QGAAQKPINLSKXIEVVQGHDE 
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SPGVFLEHLQEAYRIYTPFDLSA 

Predipted HT A A2.1 Motifs (T-rHI ppitop n-- ) (SEQ ID NOS.: 134-140) 
YLLVGIQGA 
5 GAAQKPINL 
NLSKXIEVV 
EVVQGHDES 
HLQEAYRIY 
NLAFVAQAA 
10 FVAQAAPDS 

EXAMPLE 5 

Characterization of Breast Tumor Genes Discovered by 
Differential Display PGR 

15 

The specificity and sensitivity of the breast tumor genes discovered by 
differential display PGR were determined using RT-PCR. This procedure enabled the 
rapid evaluation of breast tumor gene mRNA expression semiquantitatively without 
using large amounts of RNA. Using gene specific primers. mRNA expression levels in 
20 a variety of tissues were examined, including 8 breast tumors, 5 normal breasts, 2 
prostate tumors. 2 colon tumors, 1 lung tumor, and 14 other normal adult human tissues 
including nonnal prostate, colon, kidney, liver, lung, ovary, pancreas, skeletal muscle, 
skin, stomach and testes. 

To ensure the semiquantitative nature of the RT-PCR, P-actin was used 
25 as internal control for each of the tissues examined. Serial dilutions of the first strand 
cDNAs were prepared and RT-PCR assays perfonned using p-actin specific primers. A 
dilution was then selected that enabled the linear range amplification of p-actin 
template, and which was sensitive enough to reflect the difference in the initial copy, 
number. Using this condition, the P-actin levels were determined for each reverse' 
30 transcription reaction fi-om each tissue. DNA contamination was minimized by DNase 
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treatment and by assuring a negative result when using first strand cDNA that was 
prepared without adding reverse transcriptase. 

Using gene specific primers, the mRNA expression levels were 
determined in a variety of tissues. To date, 38 genes have been successfully examined 
5 by RT-PCR, five of which exhibit good specificity and sensitivity for breast tumors 
(B15AG-1, B31GAlb, B38GA2a, BllAla and BlSAGla). Figures 21A and 21B 
depict the results for three of these genes: B15AG-1 (SEQ ID NO:27), B31GAlb (SEQ 
ID NQ:148) and B38GA2a (SEQ ID NO. 157). Table I summarizes the expression 
level of all the genes tested in normal breast tissue and breast tumors, and also in other 
10 tissues. 

TABLE I 

Percentage of Breast Cancer Antigens that are Expressed in Various Tissues 



1 5 Over-expressed in Breast Tumors 84% 

Breast Tissues 

Equally Expressed in Normals and Tumor 1 6% 



20 Over-expressed in Breast Tumors but 

not in any Normal Tissues : 2% 

Other Tissues Over-expressed in Breast Tumors but 

Expressed in Some Normal Tissues 30% 



25 



Over-expressed in Breast Tumors but 

Equally Expressed in All Other Tissues 61% 



30 From the foregoing, it will be appreciated that, although specific 

embodiments of the invention have been described herein for the purpose of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANTS: Frudakis, Tony N. 

Smith, John M. 
Reed, Steven G. 

(ii) TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE 
. TREATMENT AND DIAGNOSIS OF BREAST CANCER 

(iii) NUMBER OF SEQUENCES: 297 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY LLP 

(B) STREET: 6300 Columbia Center, 701 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 04 -APR- 1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Maki , David J. 

(B) REGISTRATION NUMBER: 31,3 92 

(C) REFERENCE/DOCKET NUMBER: 210121.419C2 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .363 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TTA GAG ACC CAA TTG GGA CCT AAT TGG GAC CCA AAT TTC TCA AGT GGA 
Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 
1 5 10 15 

GGG AGA ACT TTT GAC GAT TTC CAC CGG TAT CTC CTC GTG GGT ATT CAG 
Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly lie Gin 
20 25 30 

GGA GCT GCC CAG AAA CCT ATA AAC TTG TCT AAG GCG ATT GAA GTC GTC 
Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala He Glu Val Val 
35 40 45 



GCT TTT AGA GAT AGC CTA AAA GGT TTT 
Ala Phe Arg Asp Ser Leu Lys Gly Phe 
115 120 



48 



96 



144 



CAG GGG CAT GAT GAG TCA CCA GGA GTG TTT TTA GAG CAC CTC CAG GAG 192 
Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 
50 55 60 

GCT TAT CGG ATT TAC ACC CCT TTT GAC CTG GCA GCC CCC GAA AAT AGC 24 0 

Ala Tyr Arg He Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 

70 75 80 

CAT GCT CTT AAT TTG GCA TTT GTG GCT CAG GCA GCC CCA GAT AGT AAA 2 88 

His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 
85 90 95 

AGG AAA CTC CAA AAA CTA GAG GGA TTT TGC TGG AAT GAA TAC CAG TCA 3 36 

Arg Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 
100 105 110 



363 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 

1 5 10 15 

Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly He Gin 

20 25 30 

Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala He Glu Val Val 

35 40 45 

Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 

50 55 60 

Ala Tyr Arg He Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
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His Ala Leu Asn 



Leu Ala Phe Val 
85 

Lys Leu Glu Gly 



70 



Ala Gin Ala Ala Pro Asp Ser Lys 



75 80 




Arg Lys Leu Gin 
100 

Ala Phe Arg Asp 



Ser Leu Lys Gly 
120 



Phe 



Phe Cys Trp Asn Glu Tyr Gin Ser 
105 110 



115 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



TCTTAGAATC 


TTCATACCCC 


GAACTCTTGG 


GAAT^CTTTA 


ATCAGTCACC 


TACAGTCTAC 


6 


CACCCATTTA 


GGAGGAGCAA 


AGCTACCTCA 


GCTCCTCCGG 


AGCCGTTTTA 


AGATCCCCCA 


12 


TCTTCAAAGC 


CTAACAGATC 


AAGCAGCTCT 


CCGGTGCACA 


ACCTGCGCCC 


AGGTAAATGC 


18 


CAAAAAAGGT 


CCTAAACCCA 


GCCCAGGCCA 


CCGTCTCCAA 


GAAAACTCAC 


CAGGAGAAAA 


24 


GTGGGAAATT 


GACTTTACAG 


AAGTAAAACC 


ACACCGGGCT, 


GGGTACAAAT 


ACCTTCTAGT 


30 


ACTGGTAGAC 


ACCTTCTCTG 


GATGGACTGA 


AGCATTTGCT 


ACCAAAAACG 


AAACTGTCAA 


36 


TATGGTAGTT 


AAGTTTTTAC 


TCAATGAAAT 


CATCCCTCGA 


CGTGGGCTGC 


CTGTTGCCAT 


42 


AGGGTCTGAT 


AATGGAACGG 


CCTTCGCCTT 


GTCTATAGTT 


TAATCAGTCA 


GTAAGGCGTT 


48 


AAACATTCAA 


TGGAAGCTCC 


ATTGTGCCTA 


TCGACCCAGA 


GCTCTGGGCA 'AGTAGAACGC 


54 


ATGAACTGCA 


CCCTAAAAAA 


ACACTCTTAC 


AAAATTAATC 


TTAAAAACCG 


GTGTTAATTG 


60 


TGTTAGTCTC 


CTTCCCTTAG 


CCCTACTTAG 


AGTTAAGGTG 


CACCCCTTAC 


TGGGCTGGGT 


66 


TCTTTACCTT 


TTGAAATCAT 


NTTTNGGAAG 


GGGCTGCCTA 


TCTTTNCTTA 


ACTAAAAAAN 


72i 


GCCCATTTGG 


CAAAAATTTC 


NCAACTAATT 


TNTACGTNCC 


TACGTCTCC'C 


CAACAGGTAN 


78i 


AAAAATCTNC 


TGCCCTTTTC 


AAGGAACCAT 


CCCATCCATT 


CCTNAACAAA 


AGGCCTGCCN 


84i 


TTCTTCCCCC 


AGTTAACTNT 


TTTTTNTTAA 


AATTCCCAAA 


AAANGAACCN 


CCTGCTGGAA 


90( 


AAACNCCCCC 


CTCCAANCCC 


CGGCCNAAGN 


GGAAGGTTCC 


CTTGAATCCC 


NCCCCCNCNA 


961 


ANGGCCCGGA 


ACCNTTAAAN 


TNGTTCCNGG 


GGGTNNGGCC 


TAAAAGNCCN 


ATTTGGTAAA 


102( 


CCTAWAAATT 


TTTTCTTTTN 


TAAAAACCAC 


NKTTTNNTTT 


TTCTTAAACA 


AAACCCTNTT 


108( 


TNTAGNANCN 


TATTTCCCNC 


C 








no: 



(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1087 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

TCTAGAGCTG CGCCTGGATC CCGCCACAGT GAGGAGACCT GAAGACCAGA GAAAACACAG 60 

CAAGTAGGCC CTTTAAACTA CTCACCTGTG TTGTCTTCTA ATTTATTCTG TTTTATTTTG 12 0 

TTTCCATCAT TTTAAGGGGT TAAAATCATC TTGTTCAGAC CTCAGCATAT AAAATGACCC 18 0 

ATCTGTAGAC CTCAGGCTCC AACCATACCC CAAGAGTTGT CTGGTTTTGT TTAAATTACT 24 0 

GCCAGGTTTC AGCTGCAGAT ATCCCTGGAA GGAATATTCC AGATTCCCTG AGTAGTTTCC 3 00 
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AGGTTAAAAT CCTATAGGCT TCTTCTGTTT TGAGGAAGAG TTCCTGTCAG AGAAAAACAT 3 60 

GATTTTGGAT TTTTAACTTT AATGCTTGTG AAACGCTATA AAAAAAATTT TCTACCCCTA 420 

GCTTTAAAGT ACTGTTAGTG AGAAATTAAA ATTCCTTCAG GAGGATTAAA CTGCCATTTC 4 80 

AGTTACCCTA ATTCCAAATG TTTTGGTGGT TAGAATCTTC TTTAATGTTC TTGAAGAAGT 54 0 

GTTTTATATT TTCCCATCNA GATAAATTCT CTCNCNCCTT NNTTTTNTNT CTNNTTTTTT 600 

AAAACGGANT CTTGCTCCGT TGTCCANGCT GGGAATTTTN TTTTGGCCAA TCTCCGCTNC 660 

CTTGCAANAA TNCTGCNTCC CAAAATTACC NCCTTTTTCC CACCTCCACC CCNNGGAATT 72 0 

ACCTGGAATT ANAGGCCCCC NCCCCCCCCC CGGCTAATTT GTTTTTGTTT TTAGTAAAAA 78 0 

ACGGGTTTCC TGTTTTAGTT AGGATGGCCC ANNTCTGACC CCNTNATCNT CCCCCTCNGC 84 0 

CCTCNAATNT TNGGNWTANG GCTTACCCCC CCCNGNNGTT TTTCCTCCAT TNAAATTTTC 90 0 

TNTGGANTCT TGAATNNCGG GTTTTCCCTT TTAAACCNAT TTTTTTTTTN NNNCCCCCAN 96 0 

TTTTNCCTCC CCCNTNTNTA ANGGGGGTTT CCCAANCCGG GTCCNCCCCC ANGTCCCCAA 102 0 

TTTTTCTCCC CCCCCCTCTT TTTTCTTTNC CCCAAAANTC CTATCTTTTC CTNNAAATAT 108 0 
CNANTNT 



1087 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: .linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

TCTAGACCAA GAAATGGGAG GATTTTAGAG TGACTGATGA TTTCTCTATC ATCTGCAGTT 60 

AGTAAACATT CTCCACAGTT TATGCAAAAA GTAACAAAAC CACTGCAGAT GACAAACACT 12 0 

AGGTAACACA CATACTATCT CCCAAATACC TACCCACAAG CTCAACAATT TTAAACTGTT 18 0 

AGGATCACTG GCTCTAATCA CCATGACATG AGGTCACCAC CAAACCATCA AGCGCTAAAC 24 0 

AGACAGAATG TTTCCACTCC TGATCCACTG TGTGGGAAGA AGCACCGAAC TTACCCACTG 3 00 

GGGGGCCTGC NTCANAANAA AAGCCCATGC CCCCGGGTNT NCCTTTNAAC CGGAACGAAT 3 60 

NAACCCACCA TCCCCACANC TCCTCTGTTC NTGGGCCCTG CATCTTGTGG CCtCNTNTNC 42 0 

TTTNGGGGAN ACNTGGGGAA GGTACCCCAT TTCNTTGACC CCNCNANAAA ACCCCNGTGG 4 80 

CCCTTTGCCC TGATTCNCNT GGGCCTTTTC TCTTTTCCCT TTTGGGTTGT TTAAATTCCC 54 0 

AATGTCCCCN GAACCCTCTC CNTNCTGCCC AAAACCTACC TAAATTNCTC NCTANGNNTT 600 

TTCTTGGTGT TNCTTTTCAA AGGTNACCTT NCCTGTTCAN NCCCNACNAA AATTTNTTCC 6 60 

NTATNNTGGN CCCNNAAAAA NNNATCNNCC CNAATTGCCC GAATTGGTTN GGTTTTTCCT 72 0 

NCTGGGGGAA ACCCTTTAAA TTTCCCCCTT GGCCGGCCCC CCTTTTTTCC CCCCTTTNGA 7 80 

AGGCAGGNGG TTCTTCCCGA ACTTCCAATT NCAACAGCCN TGCCCATTGN TGAAACCCTT 84 0 

TTCCTAAAAT TAAAAAATAN CCGGTTNNGG NNGGCCTCTT TCCCCTCCNG GNGGGNNGNG 90 0 

AAANTCCTTA CCCCNAAAAA GGTTGCTTAG CCCCCNGTCC CCACTCCCCC NGGAAAAATN 96 0 

AACCTTTTCN AAAAAAGGAA TATAANTTTN CCACTCCTTN GTTCTCTTCC 1010 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 950 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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GcSStSa GCTCTAATAC GACTCACTAT AGGGCGTCGA CTCGATCTCA 

^ = Si = = == 

5s s= sEss :™ ~: sss 

= = i™ i= = 

NHCCCNTNCC CTNNTNCNCT NCTNNTACCN TNcScScN JJSccSS "^^CTC 

(2) INFORMATION FOR SEQ ID NO : 7 : 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO • 7 - 

=i =S = ™= Ssi Si- 

SoS"- ™-c s?™ 

sss iiE = = E~ ssss 
= = = s~ = .= 

liAACTTTTT ATGANACAAA AACTTTGTTC NCTTTTCCTP r-ra»r-om^m^ ^v-iiit-t. 
CCTATTGGCC TGCCCATCCC CTrrrr^^AM CGAACCTCTC CCCCTATTAN 

TCCAAAACNT StccJotJS G^rSrr^JJi^ GGTGAAAANA TGTTCNT/^ TNCGAGGGAA 
AACNTGTCCC GGn?cc?^CN ^JcCcScCCC ^JJcSgIS SISf TTTCCTCCCC 

cSgSg'SJ? SI^^^^'^^ ^™ 

StCCCC ^^^^^^^^^-^ CCTNCACCCC AATTTGGAAN CCNGTTTTT^ J^SJ^CCN 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1177 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
950 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1086 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

NCCNTTTAGA TGTTGACAAN NTAAACAAGC NGCTCAGGCA GCTGAAAAAA GCCACTGATA 6 0 

AAGCATCCTG GAGTATCAGA GTTTACTGTT AGATCAGCCT CATTTGACTT CCCCTCCCAC 12 0 

ATGGTGTTTA AATCCAGCTA CACTACTTCC TGACTCAAAC TCCACTATTC CTGTTCATGA 180 

CTGTCAGGAA CTGTTGGAAA CTACTGAAAC TGGCCGACCT GATCTTCAAA ATGTGCCCCT 24 0 

AGGAAAGGTG GATGCCACCG TGTTCACAGA CAGTACCNCC TTCCTCGAGA AGGGACTACG 3 00 

AGGGGCCGGT GCANCTGTTA CCAAGGAGAC TNATGTGTTG TGGGCTCAGG CTTTACCANC 3 60 

AAACACCTCA NCNCNNAAGG CTG7UVTTGAT CGCCCTCACT CAGGCTCTCG GATGGGGTAA 42 0 

GGGATATTAA CGTTAACACT GACAGCAGGT ACGCCTTTGC TACTGTGCAT GTACGTGGAG 480 

CCATCTACCA GGAGCGTGGG CTACTCACTC GGCAGGTGGC TGTNATCCAC TGTAAANGGA 54 0 

CATCAAAAGG AAAACNNGGC TGTTGCCCGT GGTAACCANA AANCTGATCN NCAGCTCNAA 60 0 

GATGCTGTGT TGACTTTCAC TCNCNCCTCT TAAACTTGCT GCCCACANTC TCCTTTCCCA 660 

ACCAGATCTG CCTGACAATC CCCATACTCA AAAAAAAAAN AANACTGGCC CCGAACCCNA 72 0 

ACCAATAAAA ACGGGGANGG TNGGTNGANC NNCCTGACCC AAAAATAATG GATCCCCCGG 7 80 

GCTGCAGGAA TTCAATTCAN CCTTATCNAT ACCCCCAACN NGGNGGGGGG GGCCNGTNCC 84 0 

CATTNCCCCT NTATTNATTC TTTNNCCCCC CCCCCGGCNT CCTTTTTNAA CTCGTGAAAG 900 

GGAAAACCTG NCTTACCAAN TTATCNCCTG GACCNTCCCC TTCCNCGGTN GNTTANAAAA 960 

AAAAGCCCNC ANTCCCNTCC NAAATTTGCA CNGAAAGGNA AGGAATTTAA CCTTTATTTT 10 2 0 

TTNNTCCTTT ANTTTGTNNN CCCCCTTTTA CCCAGGCGAA CNGCCATCNT TTAANAAAAA 10 8 0 

AAANAGAANG TTTATTTTTC CTTNGAACCA TCCCAATANA AANCACCCGC NGGGGAACGG 114 0 

GGNGGNAGGC CNCTCACCCC CTTTNTGTNG GNGGGNC 117 7 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

NCCNNTTNNT GATGTTGTCT TTTTGGCCTC TCTTTGGATA CTTTCCCTCT CTTCAGAGGT 60 

GAAAAGGGTC AAAAGGAGCT GTTGACAGTC ATCCCAGGTG GGCCAATGTG TCCAGAGTAC 12 0 

AGACTCCATC AGTGAGGTCA AAGCCTGGGG CTTTTCAGAG AAGGGAGGAT TATGGGTTTT 180 

CCAATTATAC AAGTCAGAAG TAGAAAGAAG GGACATAAAC CAGGAAGGGG GTGGAGCACT 24 0 

CATCACCCAG AGGGACTTGT GCCTCTCTCA GTGGTAGTAG AGGGGCTACT TCCTCCCACC 3 00 

ACGGTTGCAA CCAAGAGGCA ATGGGTGATG AGCCTACAGG GGACATANCC GAGGAGACAT 3 60 

GGGATGACCC TAAGGGAGTA GGCTGGTTTT AAGGCGGTGG GACTGGGTGA GGGAAACTCT 42 0 

CCTCTTCTTC AGAGAGAAGC AGTACAGGGC GAGCTGAACC GGCTGAAGGT CGAGGCGAAA 480 

ACACGGTCTG GCTCAGGAAG ACCTTGGAAG TAAAATTATG AATGGTGCAT GAATGGAGCC 54 0 

ATGGAAGGGG TGCTCCTGAC CAAACTCAGC CATTGATCAA TGTTAGGGAA ACTGATCAGG 6 00 

GAAGCCGGGA ATTTCATTAA CAACCCGCCA CACAGCTTGA ACATTGTGAG GTTCAGTGAC 660 

CCTTCAAGGG GCCACTCCAC TCCAACTTTG GCCATTCTAC TTTGCNAAAT TTCCAAAACT 72 0 

TCCTTTTTTA AGGCCGAATC CNTANTCCCT NAAAAACNAA AAAAAATCTG CNCCTATTCT 7 80 

GGAAAAGGCC CANCCCTTAC CAGGCTGGAA GAAATTTTNC CTTTTTTTTT TTTTTGAAGG 84 0 

CNTTTNTTAA ATTGAACCTN AATTCNCCCC CCCAAAAAAA AACCCNCCNG GGGGGCGGAT 90 0 

TTCCAAAAAC NAATTCCCTT ACCAAAAAAC AAAAACCCNC CCTTNTTCCC TTCCNCCCTN 96 0 

TTCTTTTAAT TAGGGAGAGA TNAAGCCCCC CAATTTCCNG GNCTNGATNN GTTTCCCCCC 102 0 

CCCCCATTTT CCNAAACTTT TTCCCANCNA GGAANCCNCC CTTTTTTTNG GTCNGATTNA 10 8 0 

NCAACCTTCC AAACCATTTT TCCNNAAAAA NTTTGNTNGG NGGGAAAAAN ACCTNNTTTT 114 0 
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ATAGAN 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CTTCATTGGG TACGGGCCCC CTCGAGGTCG ACGGTATCGA TAAGCTTGAT ATCGAATTCC 60 

TGCAGCCCGG GGGATCCACT AGTTCTAGAG TCAGGAAGAA CCACCAACCT TCCTGATTTT 12 0 

TATTGGCTCT GAGTTCTGAG GCCAGTTTTC TTCTTCTGTT GAGTATGCGQ GATTGTCAGG 180 

CAGATCTGGC TGTGGAAAGG AGACTGTGGG CAGCAAGTTT AGAGGCGTGA CTGAAAGTC/^ 240 

. CACTGCATCT TGAGCTGCTG AATCAGCTTT CTGGTTACCA CGGGCAACAG CCGTGTTTTC 3 00 

CTTTTGATGT CCTTTACAGT GGATTACAGC CACCTGCTGA GGTGAGTAGC CCACGCTCCT 3 60 

GGTAGATGGC TCCACGTACA TGCACAGTAG CAAAGGCGTA CCTGCTGTCA GTGTTAACGT 42 0 

TAATATCCTT ACCCCATCGG AGAGCCTGAG TGAGGGCGAT CAATTCAGCC CTTTTGTGCT 4 80 

GAGGTGTTTG CTGGTTAAGC CCTGAACCCA CAACACATCT GTCTCCATGG TAACAGCTGC 54 0 
ACCGG 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TCTCCTAGGC TGGGCACAGT GGCTCATACC TGTAATCCTG ACCGTTTCAG AGGCTCAGGT 
GGGGGGATCG CTTGAGCCCA AGATTTCAAG ACTAGTCTGG GTAACATAGT GAGACCCTAT 
CTCTACGAAA AAATAAAAAA ATGAGCCTGG TGTAGTGGCA CACACCAGCT GAGGAGGGAG 
AATCGAGCCT AGGAGA 

(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 



TCTCCTAGGC TTGGGGGCTC TGACTAGAAA TTCAAGGAAC CTGGGATTCA AGTCCAACTG 
TGACACCAAC TTACACTGTG GNCTCCAATA AACTGCTTCT TTCCTATTCC CTCTCTATTA 
AATAAAATAA GGAAAACGAT GTCTGTGTAT AGCCAAGTCA GNTATCCTAA AAGGAGATAC 
TAAGTGACAT TAAATATCAG AATGTAAAAC CTGGGAACCA GGTTCCCAGC CTGGGATTAA 
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ACTGACAGCA AGAAGACTGA ACAGTACTAC TGTGAAAAGC CCGAAGNGGC AATATGTTCA 3 00 

CTCTACCGTT GAAGGATGGC TGGGAGAATG AATGCTCTGT CCCCCAGTCC CAAGCTCACT 3 60 

TACTATACCT CCTTTATAGC CTAGGAGA 3 8 8 

(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

TAGTAGTTGC CTATAATCAT GTTTCTCATT ATTTTCACAT TTTATTAACC AATTTCTGTT 6 0 

TACCCTGAAA AATATGAGGG AAATATATGA AACAGGGAGG CAATGTTCAG ATAATTGATC 12 0 

ACAAGATATG ATTTCTACAT CAGATGCTCT TTCCTTTCCT GTTTATTTCC TTTTTATTTC 18 0 

GGTTGTGGGG TCGAATGTAA TAGCTTTGTT TCAAGAGAGA GTTTTGGCAG TTTCTGTAGC 24 0 

TTCTGACACT GCTCATGTCT CCAGGCATCT ATTTGCACTT TAGGAGGTGT CGTGGGAGAC 3 00 

TGAGAGGTCT ATTTTTTCCA TATTTGGGCA ACTACTA 337 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRA2TOEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

TAGTAGTTGC CATACAGTGC CTTTCCATTT ATTTAACCCC CACCTGAACG GCATAAACTG 6 0 

AGTGTTCAGC TGGTGTTTTT TACTGTAAAC AATAAGGAGA CTTTGCTCTT CATTTAAACC 12 0 

AAAATCATAT TTCATATTTT ACGCTCGAGG GTTTTTACCG GTTCCTTTTT ACACTCCTTA 18 0 

AAACAGTTTT TAAGTCGTTT GGAACAAGAT ATTTTTTCTT TCCTGGCAGC TTTTAACATT 24 0 

ATAGCAAATT TGTGTCTGGG GGACTGCTGG TCACTGTTTC TCACAGTTGC AAATCAAGGC 3 00 

ATTTGCAACC AAGAAAAAAA AATTTTTTTG TTTTATTTGA AACTGGACCG GATAAACGGT 3 60 

GTTTGGAGCG GCTGCTGTAT ATAGTTTTAA ATGGTTTATT GCACCTCCTT AAGTTGCACT 42 0 

TATGTGGGGG GGGGNTTTTG NATAGAAAGT NTTTANTCAC ANAGTCACAG GGACTTTTNT 4 80 

CTTTTGGNNA CTGAGCTAAA AAGGGCTGNT TTTCGGGTGG GGGCAGATGA AGGCTCACAG 54 0 

GAGGCCTTTC TCTTAGAGGG GGGAACTNCT A 5 71 

(2) INFORMATION FOR SEQ ID NO : 15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
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TATATATTTA ATAACTTAAA TATATTTTGA TCACCCACTG GGGTGATAAG ACAATAGATA 6 0 

TAAAAGTATT TCCAAAAAGC ATAAAACCAA AGTATCATAC CAAACCAAAT TCATACTGCT 12 0 

TCCCCCACCC GCACTGAAAC TTCACCTTCT AACTGTCTAC CTAACCAAAT TCTACCCTTC 18 0 

AAGTCTTTGG TGCGTGCTCA CTACTCTTTT tTTTTTTTTT TTTNTTTTGG AGATGGAGTC 24 0 

TGGCTGTGCA GCCCAGGGGT GGAGTACAAT GGCACAACCT CAGCTCACTG NAACCTCCGC 3 00 

CTCCCAGGTT CATGAGATTC TCCTGNTTCA GCCTTCCCAG TAGCTGGGAC TACAGGTGTG 360 

CATCACCATG CCTGGNTAAT CTTTTTTNGT TTTNGGGTAG AGATGGGGGT TTTACATGTT 42 0 

GGCCAGGNTG GTNTCGAACT CCTGACCTCA AGTGATCCAC CCACCTCAGG CTCCCAAAGT 4 80 

GCTAGGATTA CAGACATGAG CCACTGNGCC CAGNCCTGGT GCATGCTCAC TTCTCTAGGC 54 0 

AACTACTA 54 g 

(2) INFORMATION FOR SEQ ID NO : 16 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TTCCGTTATG CACATGCAGA ATATTCTATC GGTACTTCAG CTATTACTCA TTTTGATGGC 6 0 

GCAATCCGAG CCTATCCTCA AGATGAGTAT TTAGAAAGAA TTGATTTAGC GATAGACCAA 12 0 

GCTGGTAAGC ACTCTGACTA CACGAAATTG TTCAGATGTG ATGGATTTAT GACAGTTGAT ' 180 

CTTTGGAAGA GATTATTAAG TGATTATTTT AAAGGGAATC CATTAATTCC AGAATATCTT 24 0 

GGTTTAGCTC AAGATGATAT AGAAATAGAA CAGAAAGAGA CTACAAATGA AGATGTATCA 3 00 

CCAACTGATA TTGAAGAGCC TATAGTAGAA AATGAATTAG CTGCATTTAT TAGCCTTACA 3 60 

CATAGCGATT TTCCTGATGA ATCTTATATT CAGCCATCGA CATAGCATTA CCTGATGGGC 42 0 

AACCTTACGA ATAATAGAAA CTGGGTGCGG GGCTATTGAT GAATTCATCC NCAGTAAATT 4 80 

TGGATATNAC AAAATATAAC TCGATTGCAT TTGGATGATG GAATACTAAA TCTGGCAAAA 54 0 

GTAACTTTGG AGCTACTAGT AACCTCTCTT TTTGAGATGC AAAATTTTCT TTTAGGGTTT 60 0 

CTTATTCTCT ACTTTACGGA TATTGGAGCA TAACGGGA 63 8 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ACTGATGGAT GTCGCCGGAG GCGAGGGGCC TTATCTGATG CTCGGCTGCC TGTTCGTGAT 60 

GTGCGCGGCG ATTGGGCTGT TTATCTCAAA CACCGCCACG GCGGTGCTGA TGGCGCCTAT 12 0 

TGCCTTAGCG GCGGCGAAGT CAATGGGCGT CTCACCCTAT CCTTTTGCCA TGGTGGTGGC 18 0 

GATGGCGGCT TCGGCGGCGT TTATGACCCC GGTCTCCTCG CCGGTTAACA CCCTGGTGCT 24 0 

TGGCCCTGGC AAGTACTCAT TTAGCGATTT TGTCAAAATA GGCGTG 2 86 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 62 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TCGGTCATAG CAGCCCCTTC TTCTCAATTT CATCTGTCAC TACCCTGGTG TAGTATCTCA 60 

TAGCCTTACA TTTTTATAGC CTCCTCCCTG GTCTGTCTTT TGATTTTCCT GCCTGTAATC 12 0 

CATATCACAC ATAACTGCAA GTAAACATTT CTAAAGTGTG GTTATGCTCA TGTCACTCCT 18 0 

GTGNCAAGAA ATAGTTTCCA TTACCGTCTT AATAAAATTC GGATTTGTTC TTTNCTATTN 24 0 

TCACTCTTCA CCTATGACCG AA 262 

(2) INFORMATION FOR SEQ ID NO : 19 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID' NO: 19: 

TCGGTCATAG CAAAGCCAGT GGTTTGAGCT CTCTACTGTG TAAACTCCTA AACCAAGGCC 6 0 

ATTTATGATA AATGGTGGCA GGATTTTTAT TATAAACATG TACCCATGCA AATTTCCTAT 12 0 

AACTCTGAGA TATATTCTTC TACATTTAAA CAATAAAAAT AATCTATTTT TAAAAGCCTA 18 0 

ATTTGCGTAG TTAGGTAAGA GTGTTTAATG AGAGGGTATA AGGTATAAAT CACCAGTCAA 24 0 

CGTTTCTCTG CCTATGACCG A 261 

(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 : 

TACAACGAGG CGACGTCGGT AAAATCGGAC ATGAAGCCAC CGCTGGTCTT TTCGTCCGAG 6 0 

CGATAGGCGC CGGCCAGCCA GCGGAACGGT TGCCCGGATG GCGAAGCGAG CCGGAGTTCT 12 0 

TCGGACTGAG TATGAATCTT GTTGTGAAAA TACTCGCCGC CTTCGTTCGA CGACGTCGCG 18 0 

TCGAAATCTT CGANCTCCTT ACGATCGAAG TCTTCGTGGG CGACGATCGC GGTCAGTTCC 24 0 

GCCCCACCGA AATCATGGTT GAGCCGGATG CTGNCCCCGA AGNCCTCGTT TGTN 2 94 

(2) INFORMATION FOR SEQ ID NO : 2 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

TTGGTAAAGG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 
ATCAATGAAT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 
GTTCTCATGG ANAAGGCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 
CAACTAGTCG NCTTGCNANG ATCTTCAT 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

NCCNTTGAGC TGAGTGATTG AGATNTGTAA TGGTTGTAAG GGTGATTCAG GCGGATTAGG 
GTGGCGGGTC ACCCGGCAGT GGGTCTCCCG ACAGGCCAGC AGGATTTGGG GCAGGTACGG 
NGTGCGCATC GCTCGACTAT ATGCTATGGC AGGCGAGCCG TGGAAGGNGG ATCAGGTCAC 
GGCGCTGGAG CTTTCCACGG TCCATGNATT GNGATGGCTG TTCTAGGCGG CTGTTGCCAA 
GCGTGATGGT ACGCTGGCTG GAGCATTGAT TTCTGGTGCC AAGGTGG 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

TTGGGTAAAG GGAGCAAGGA GAAGGCATGG AGAGGCTCAN GCTGGTCCTG GCCTACGACT 
GGGCCAAGCT GTCGCCGGGG ATGGTGGAGA ACTGAAGCGG GACCTCCTCG AGGTCCTCCG 
NCGTTACTTC NCCGTCCAGG AGGAGGGTCT TTCCGTGGTC TNGGAGGAGC GGGGGGAGAA 
GATNCTCCTC ATGGTCNACA TCCC 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
287 



60 
120 
180 
204 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

TGGATTGGTC AGGAGCGGGT AGAGTGGCAC CATTGAGGGG ATATTCAAAA ATATTATTTT 
GTCCTAAATG ATAGTTGCTG AGTTTTTCTT TGACCCATGA GTTATATTGG AGTTTATTTT 
TTAACTTTCC AATCGCATGG ACATGTTAGA CTTATTTTCT GTTAATGATT NCTATTTTTA 
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TTAAATTGGA TTTGAGAAAT TGGTTNTTAT TATATCAATT TTTGGTATTT GTTGAGTTTG 24 0 

ACATTATAGC TTAGTATGTG ACCA 2 64 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



TTACAACGAG GGGAAACTCC GTCTCTACAA AAATTAAAAA ATTAGCCAGG TGTGGTGGTG 60 

TGCACCCGCA ATCCCAGCTA CTTGGGAGGT TGAGACACAA GANTCACCTA NATGTGGGAG 12 0 

GTCAAGGTTG CATGAGTCAT GATTGTGCCA CTGCACTCCA GCCTGGGTGA CAGACCGAGA 180 

CCCTGCCTCA ANAGANAANG AATAGGAAGT TCAGAAATCN TGGNTGTGGN GCCCAGCAAT 24 0 

CTGCATCTAT NCAACCCCTG CAGGCAANGC TGATGCAGCC TANGTTCAAG AGCTGCTGTT 3 00 

TCTGGAGGCA GCAGTTNGGG CTTCCATCCA GTATCACGGC CACACTCGCA CNAGCCATCT 3 60 

GTCCTCCGTN TGTNAC 3 76 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



TTACAACGAG GGGAAACTCC GTCTCTACAA AAATTAAAAA ATTAGCCAGG TGTGGTGGTG 60 

TGCACCTGTA ATCCCAGCTA CTTGGGCGGC TGAGACACAA GAACCACCTA AATGTGGGAG 12 0 

GGTCAAGGTT GCATGAGTCA TGATCGCGCC ACTGCACTCC AGCCTGGGTG ACAGACTGAG 180 

ACCCTGCCTC AAAAGAAAAA GAATAGGAAG TTCAGAAACC CTGGGTGTGG NGCCCAGCAA 24 0 

TCTGCATTTA AACAATCCCT GCAGGCAATG CTGATGCAGC . CTAAGTTCAA GAGCTGCTGT 3 00 

TCTGGAGGCA GNAGTAAGGG CTTCCATCCA GCATCACGGN CAACACTGCA AAAGCACCTG 3 60 

TCCTCGTTGG TA 3 72 



(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TTCTGTCCAC ATCTACAAGT TTTATTTATT TTGTGGGTTT TCAGGGTGAC TAAGTTTTTC 60 

CCTACATTGA AAAGAGAAGT TGCTAAAAGG TGCACAGGAA ATCATTTTTT TAAGTGAATA 120 

TGATAATATG GGTCCGTGCT TAATACAACT GAGACATATT TGTTCTCTGT TTTTTTAGAG 180 
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TCACCTCTTA AAGTCCAATC CCACAATGGT GAAAAAAAAA TAGAAAGTAT TTGTTCTACC 24 0 

TTTAAGGAGA CTGCAGGGAT TCTCCTTGAA. AACGGAGTAT GGAATCAATC TTAAATAAAT 3 00 

ATGAAATTGG TTGGTCTTCT GGGATAAGAA ATTCCCAACT CAGTGTGCTG AAATTCACCT 3 60 

GACTTTTTTT GGGAAAAAAT AGTCGAAAAT. GTCAATTTGG TCCATAAAAT ACATGTTACT 42 0 

ATTAAAAGAT ATTTAAAGAC AAATTCTTTC AGAGCTCTAA GATTGGTGTG GACAGAA 477 

(2) INFORMATION FOR SEQ ID NO : 2 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

TCTNCAACCT CTTGANTGTC AAAAACCTTN TAGGCTATCT CTAAAAGCTG ACTGGTATTC 6 0 

ATTCCAGCAA AATCCCTCTA GTTTTTGGAG TTTCCTTTTA CTATCTGGGG CTGCCTGAGC 12 0 

CACAAATGCC AAATTAAGAG CATGGCTATT TTCGGGGGCT GACAGGTCAA AAGGGGTGTA 18 0 

AATCCGATAA GCCTCCTGGA GGTGCTCTAA AAACACTCCT GGTGACTCAT CATGCCCCTG 24 0 

GACGACTTCA ATCGNCTTAG ACAAGTTTAT AGGTTTCTGG GCAGCTCCCT GAATACCCAC 3 00 

GAGGAGATAC CGGTGGAAAT CGTCAAAAGT TCTCCCTCCA CTTGAGAAAT TTGGGTCCCA 360 

ATTAGGTCCC AATTGGGTCT CTAATCACTA TTCCTCTAGC TTCCTCCTCC GGNCTATTGG 42 0 

TTGATGTGAG GTTGAAGA 43 q 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

AAGAGGGTAC CAGCCCCAAG CCTTGACAAC TTCCATAGGG TGTCAAGCCT GTGGGTGCAC 6 0 

AGAAGTCAAA AATTGAGTTT TGGGATCCTC AGCCTAGATT TCAGAGGATA TAAAGAAACA 12 0 
CCTAACACCT AGATATTCAG ACAAAAGTTT ACTACAGGGA TGAAGCTTTC ACGGAAAACC . 180 

TCTACTAGGA AAGTACAGAA GAGAAATGTG GGTTTGGAGC CCCCAAACAG AATCCCCTCT 24 0 

AGAACACTGC CTAATGAAAC TGTGAGAAGA TGGCCACTGT CATCCAGACA CCAGAATGAT 3 00 

AGACCCACCA AAAACTTATG CCATATTGCC TATAAAACCT ACAGACACTC AATGCCAGCC 360 

CCATGAAAAA AAAACTGAGA AGAAGACTGT NCCCTACAAT GCCACCGGAG CAGAACTGCC 42 0 

CCAGGCCATG GAAGCACAGC TCTTATATCA ATGTGACCTG GATGTTGAGA CATGGAATCC 4 80 

NANGAAATCN TTTTAANACT TCCACGGTTN AATGACTGCC CTATTANATT CNGAACTTAN 54 0 

ATCCNGGCCT GTGACCTCTT TGCTTTGGCC ATTCCCCCTT TTTGGAATGG CTNTTTTTTT 6 00 

CCCATGCCTG TNCCCTCTTA g2 0 

(2) INFORMATION FOR SEQ ID NO : 3 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 

TTACAACGAG GGGGTCAATG TCATAAATGT CACAATAAAA CAATCTCTTC tTTTTTTTTT 6 0 

TTTTTTTTTT -pTTTTTTTTT tTTTTTTTTT -p-pTTTTTTTT 10 0 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 



TAGTCTATGC 


GCCGGACAGA 


GCAGAATTAA 


ATTGGAAGTT 


GCCCTCCGGA 


CTTTCTACCC 


60 


ACACTCTTCC 


TGAAAAGAGA 


AAGAAAAGAG 


GCAGGAAAGA 


GGTTAGGATT 


TCATTTTCAA 


120 


GAGTCAGCTA 


ATTAGGAGAG 


CAGAGTTTAG 


ACAGCAGTAG 


GCACCCCATG 


ATACAAACCA 


180 


TGGACAAAGT 


CCCTGTTTAG 


TAACTGCCAG 


ACATGATCCT 


GCTCAGGTTT 


TGAAATCTCT 


240 


CTGCCCATAA 


AAGATGGAGA 


GCAGGAGTGC 


CATCCACATC 


AACACGTGTC 


CAAGAAAGAG 


300 


TCTCAGGGAG 


ACAAGGGTAT 


CAAAAAACAA 


GATTCTTAAT 


GGGAAGGAAA 


TCAAACCAAA 


360 


AAATTAGATT 


TTTCTCTACA 


TATATATAAT 


ATACAGATAT 


TTAACACATT 


ATTCCAGAGG 


420 


TGGCTCCAGT 


CCTTGGGGCT 


TGAGAGATGG 


TGAAAACTTT 


TGTTCCACAT 


TAACTTCTGC 


480 


TCTCAAATTC 


TGAAGTATAT 


CAGAATGGGA 


CAGGCAATGT 


TTTGCTCCAC 


ACTGGGGCAC 


540 


AGACCCAAAT 


GGTTCTGTGC 


CCGAAGAAGA 


GAAGCCCGAA 


AGACATGAAG 


GATGCTTAAG 


600 


GGGGGTTGGG 


AAAGCCAAAT 


TGGTANTATC 


TTTTCCTCCT 


GCCTGTGTTC 


CNGAAGTCTC 


660 


CNCTGAAGGA 


ATTCTTAAAA 


CCCTTTGTGA 


GGAAATGCCC 


CCTTACCATG 


ACAANTGGTC 


720 


CCATTGCTTT 


TAGGGNGATG 


GAAACACCAA 


GGGTTTTGAT 


CC 




762 



(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

TAGTCTATGC GTGTATTAAC CTCCCCTCCC TCAGTAACAA CCAAAGAGGC AGGAGCTGTT 60 

ATTACCAACC CCATTTTACA GATGCATCAA TAATGACAGA GAAGTGAAGT GACTTGCGCA 12 0 

CACAACCAGT AAATTGGCAG AGTCAGATTT GAATCCATGG AGTCTGGTCT GCACTTTCAA 18 0 

TCACCGAATA CCCTTTCTAA GAAACGTGTG CTGAATGAGT GCATGGATAA ATCAGTGTCT 24 0 

ACTCAACATC TTTGCCTAGA TATCCCGCAT AGACTA 27 6 

(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 

TAGTAGTTGC CAAATATTTG AAAATTTACC CAGAAGTGAT TGAAAACTTT TTGGAAACAA 
AAACAAATAA AGCCAAAAGG TAAAATAAAA ATATCTTTGC ACTCTCGTTA TTACCTATCC 
ATAACTTTTT CACCGTAAGC TCTCCTGCTT GTTAGTGTAG TGTGGTTATA TTAAACTTTT 
TAGTTATTAT TTTTTATTCA CTTTTCCACT AGAAAGTCAT TATTGATTTA GCACACATGT 
TGATCTCATT TCATTTTTTC TTTTTATAGG CAAAATTTGA TGCTATGCAA CAAAAATACT 
CAAGCCCATT ATCTTTTTTC CCCCCGAAAT CTGAAAATTG CAGGGGACAG AGGGAAGTTA 
TCCCATTAAA AAATTGTAAA TATGTTCAGT TTATGTTTAA AAATGCACAA AACATAAGAA 
AATTGTGTTT ACTTGAGCTG CTGATTGTAA GCAGTTTTAT CTCAGGGGCA ACTACTA 

(2) INFORMATION FOR SEQ ID NO: 34: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34; 



TAGTAGTTGC CAATTCAGAT GATCAGAAAT GCTGCTTTCC TCAGCATTGT CTTGTTAAAC 6 0 

CGCATGCCAT TTGGAACTTT GGCAGTGAGA AGCCAAAAGG AAGAGGTGAA TGACATATAT 12 0 

ATATATATAT ATTCAATGAA AGTAAAATGT ATATGCTCAT ATACTTTCTA GTTATCAGAA 18 0 

TGAGTTAAGC TTTATGCCAT TGGGCTGCTG CATATTTTAA TCAGAAGATA AAAGAAAATC 24 0 

TGGGCATTTT TAGAATGTGA TACATGTTTT TTTAAAACTG TTAAATATTA TTTCGATATT 3 00 

TGTCTAAGAA CCGGAATGTT CTTAAAATTT ACTAAAACAG TATTGTTTGA GGAAGAGAAA 3 60 

ACTGTACTGT TTGCCATTAT TACAGTCGTA CAAGTGCATG TCAAGTCACC CACTCTCTCA 42 0 

GGCATCAGTA TCCACCTCAT AGCTTTACAC ATTTTGACGG GGAATATTGC AGCATCCTCA 4 80 

GGCCTGACAT CTGGGAAAGG CTCAGATCCA CCTACTGCTC CTTGCTCGTT GATTTGTTTT 54 0 

AAAATATTGT GCCTGGTGTC ACTTTTAAGC CACAGCCCTG CCTAAAAGCC AGCAGAGAAC 60 0 
AGAACCCGCA CCATTCTATA GGCAACTACT A 

(2) INFORMATION FOR SEQ ID NO: 35: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 



TAGTAGTTGC CATCCCATAT TACAGAAGGC TCTGTATACA TGACTTATTT GGAAGTGATC 
TGTTTTCTCT CCAAACCCAT TTATCGTAAT TTCACCAGTC TTGGATCAAT CTTGGTTTCC 
ACTGATACCA TGAAACCTAC TTGGAGCAGA CATTGCACAG TTTTCTGTGG TAAAAACTAA 
AGGTTTATTT GCTAAGCTGT CATCTTATGC TTAGTATTTT TTTTTTACAG TGGGGAATTG 
CTGAGATTAC ATTTTGTTAT TCATTAGATA CTTTGGGATA ACTTGACACT GTCTTCTTTT 
TTTCGCTTTT AATTGCTATC ATCATGCTTT TGAAACAAGA ACACATTAGT CCTCAAGTAT 
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TACATAAGCT TGCTTGTTAC GCCTGGTGGT TTAAAGGACT ATCTTTGGCC TCAGGTTCAC 
AAGAATGGGC AAAGTGTTTC CTTATGTTCT GTAGTTCTCA ATAAAAGATT GCCAGGGGCC 
GGGTACTGTG GCTCGCACTG TAATCCCAGC ACTTTGGGAA GCTGAGGCTG GCGGATCATG 
TTAGGGCAGG TGTTCGAAAC CAGCCTGGGC AACTACTA 

(2) INFORMATION FOR SEQ ID NO : 3 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



TAGTAGTTGC CTGTAATCCC AGCAACTCAG GAGGCTGGGG CAGGAGAATC AGTTGAACCT 6 0 

GGGAGGCAGA AGTTGTAATT AGCAAAGATC GCACCATTGC ACTTCAGCCT GGGCAACAAG 12 0 

AGTGAGATTC CATCTCAAAA ACAAAAAAAA GAAAAAGAAA AGAAAAGGAA AAAACGTATA 18 0 

AACCCAGCCA AAACAAAATG ATCATTCTTT TAATAAGCAA GACTAATTTA ATGTGTTTAT 24 0 

TTAATCAAAG CAGTTGAATC TTCTGAGTTA TTGGTGAAAA TACCCATGTA GTTAATTTAG 3 00 

GGTTCTTACT TGGGTGAACG TTTGATGTTC ACAGGTTATA AAATGGTTAA CAAGGAAAAT 3 60 

GATGCATAAA GAATCTTATA AACTACTAAA AATAAATAAA ATATAAATGG ATAGGTGCTA 42 0 
TGGATGGAGT TTTTGTGTAA TTTAAAATCT TGAAGTCATT TTGGATGCTC ATTGGTTGTC ' 480 

TGGTAATTTC CATTAGGAAA AGGTTATGAT ATGGGGAAAC TGTTTCTGGA AATTGCGGAA 54 0 

TGTTTCTCAT CTGTAAAATG CTAGTATCTC AGGGCAACTA CTA 5 83 

(2) INFORMATION FOR SEQ ID NO : 3 7 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



GATCTACTAG TCATNTGGAT TCTATCCATG GCAGCTAAGC CTTTCTGAAT GGATTCTACT 6 0 

GCTTTCTTGT TCTTTAATCC AGACCCTTAT ATATGTTTAT GTTCACAGGC AGGGCAATGT 12 0 

TTAGTGAAAA CAATTCTAAA TTTTTTATTT TGCATTTTCA TGCTAATTTC CGTCACACTC 18 0 

CAGCAGGCTT CCTGGGAGAA TAAGGAGAAA TACAGCTAAA GACATTGTCC CTGCTTACTT 24 0 

ACAGCCTAAT GGTATGCAAA ACCACTTCAA TAAAGTAACA GGAAAAGTAC TAACCAGGTA 300 

GAATGGACCA AAACTGATAT AGAAAAATCA GAGGAAGAGA GGAACAAATA TTTACTGAGT 3 60 

CCTAGAATGT ACAAGGCTTT TTAATTACAT ATTTTATGTA AGGCCTGCAA AAAACAGGTG 42 0 

AGTAATCAAC ATTTGTCCCA TTTTACATAT AAGGAAACTG AAGCTTAAAT TGAATAATTT 4 80 

AATGCATAGA TTTTATAGTT AGACCATGTT CAGGTCCCTA TGTTATACTT ACTAGCTGTA 54 0 

TGAATATGAG AAAATAATTT TGTTATTTTC TTGGCATCAG TATTTTCATC TGCAAAATAA 600 

AGCTAAAGTT ATTTAGCAAA CAGTCAGCAT AGTGCCTGAT ACATAGTAGG TGCTCCAAAC 660 

ATGATTACNC TANTATTNGG TATTANAAAA ATCCAATATA GGCNTGGATA AAACCG 716 

(2) INFORMATION FOR SEQ ID NO : 3 8 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 688 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

"cl?™ clilZlclc i^cllTJlt — TTCAA TOA^^TC. 

AAAAAAAAAA ACCAAAcSJ CcSSJJ?^ TTTTTTTTTA CCTTAAAAAA 

TTTCTCTTAC AACtcJc^ CA^r^J ^I^^''''^'^ AGAGTTCTAA AAAATTTACA 
AACAGAGAAA CTTGaSZ AgSS^^c?? lal^Zl"^ GTCTGTTAAA TCTTGGCATT 
TCTCCCCCTA TTGTTTTGCC JSSSJSg ^SgITL^J'^ ^TTTGTCTAA 

AGTGTAAACA ATGTATAGGA AGGAATATAT r^llll^l^ TGGAACATCC CCGTAGTTGA 
TGTAGGGACC TTCACAACTT clT^lrlrl ^J^^^^^^^^ TGCATCACAT ATGCATTACA 
GCCCAGGGTC ACCAT^Sq ^gSSSIp? ^J^J^'^''^^^ TTGAAGAGGA GGAGAGGACG 
TTTAGAAGAC CATgSS^ JgStSS?? SSnJ^'^^'' CAGAAGTGGC ACTGTTGAAA 
CGAGAAATTT CTT^ScA^? SG^IS^An nnlT''''^'''' ^^^^^^^^AA GAAGTGCCTC 
ACGAAGAAAA TGaSScJg CC^? TGGGTACGGT GGGTGACCCA 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
TCTGTCCACA CCAATCTTAG AAGCTCTGAA AAGAATTTOT r-rT-a^Tv . . 

T««T3™x ™..O.CC «.™.C„ t"S?JSSI SSSS? 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
688 



SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

GGgS?g?S a?g™ ----- CTAGGAAACA TANTAAAATA 

TGACAAATGC ATAT^cSS aScScI ACr^I^^ GTGCATANTT TTATGTATTT 
TTTGGCCGGG CGTGGTGGGC GGTCGcSS rrr^nl AAGCTATTAC AATTAAAAAG 

OOCACGCGGA TCACgIgg^C gSaSSS^ GACcSJS^^ SSSJSS r^^^^ 
TCTCTACTAA AAATACGAAA AAATTACCCC GGcSgSgo rr^i^n^ TGAAAGTCCA 
TACTCCGGAG GCTGAGGCAG GAGAATCrrP n-n^^l^-t ^°°°CGCCTG TAGTCCCAGC 
CAACATCACG TCACTGCcJ? tcA^CTCrr ACACGGAGCT TGCAGTGTGC 

AGAAAAATAC TAcSaS SJnaSSa ^fr^l ^OANTCCCG TCCTCANAAA 480 
CCCCCTTACC ATTCATCTCA CCCaJSSJ ItIg^J^ nJ5^^'™ CCTCTTGGTA 540 

(2) INFORMATION FOR SEQ ID NO: 40: 



60 
120 
180 
240 
300 
360 
420 



60 
120 
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GAATTTCAGC ACACTGAGTT GGGAATTTCT TATCCCAGAA GACCAACCAA TTTCATATTT 18 0 

ATTTAAGATT GATTCCATAC TCCGTTTTCA AGGAGAATCC CTGCAGTCTC CTTAAAGGTA 24 0 

GAACAAATAC TTCCTATTTT TTTTTCACCA TTGTGGGATT GGACTTTAAG AGGTGACTCT 3 00 

AAAAAAACAG AGAACAAATA TGTCTCAGTT GTATTAAGCA CGGACCCATA TTATCATATT 3 60 

CACTTAAAAA AATGATTTCC TGTGCACCTT TTGGCAACTT CTCTTTTCAA TGTAGGGAAA 42 0 

AACTTAGTCA CCCTGAAAAC CCACAAAATA AATAAAACTT GTAGATGTGG ACAGA 4 75 

(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 

TAAGAGGGTA CATCGGGTAA GAACGTAGGC ACATCTAGAG CTTAGAGAAG TCTGGGGTAG 60 

GAAAAAAATC TAAGTATTTA TAAGGGTATA GGTAACATTT AAAAGTAGGG CTAGCTGACA 12 0 

TTATTTAGAA AGAACACATA CGGAGAGATA AGGGCAAAGG ACTAAGACCA GAGGAACACT 18 0 

AATATTTAGT GATCACTTCC ATTCTTGGTA AAAATAGTAA CTTTTAAGTT AGCTTCAAGG 24 0 

AAGATTTTTG GCCATGATTA GTTGTCAAAA GTTAGTTCTC TTGGGTTTAT ATTACTAATT 300 

TTGTTTTAAG ATCCTTGTTA GTGCTTTAAT AAAGTCATGT TATATCAAAC GCTCTAAAAC 3 60 

ATTGTAGCAT GTTAAATGTC ACAATATACT TACCATTTGT TGTATATGGC TGTACCCTCT 42 0 

CTA . 423 

(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

TCTCCTAGGC TAATGTGTGT GTTTCTGTAA AAGTAAAAAG TTAAAAATTT TAAAAATAGA 6 0 

AAAAAGCTTA TAGAATAAGA ATATGAAGAA AGAAAATATT TTTGTACATT TGCACAATGA 12 0 

GTTTATGTTT TAAGCTAAGT GTTATTACAA AAGAGCCAAA AAGGTTTTAA AAATTAAAAC 18 0 

GTTTGTAAAG TTACAGTACC CTTATGTTAA TTTATAATTG AAGAAAGAAA AACTTTTTTT 24 0 

TATAAATGTA GTGTAGCCTA AGCATACAGT ATTTATAAAG TCTGGCAGTG TTCAATAATG 3 00 

TCCTAGGCCT TCACATTCAC TCACTGACTC ACCCAGAGCA ACTTCCAGTC CTGTAAGCTC 360 

CATTCGTGGT AAGTGCCCTA TACAGGTGCA CCATTTATTT TACAGTATTT TTACTGTACC 42 0 

TTCTCTATGT TTCCATATGT TTCGATATAC AAATACCACT GGTTACTATN GCCCNACAGG 4 80 

TAATTCCAGT AACACGGCCT GTATACGTCT GGTANCCCTA GNGAAGA 52 7 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO .-43: 

iiE€ ~B =^ -™ 
ES = ~- — — ? SIS 

ATTTACGGTT CAAAAGAAGT TGTAATATTG TfirTTor'n 7\r> n^Tv^, ^^^J-^^iiaAi 
TTCCTACTAC TATTATATAA TAAaJI^JS J^^^^^^^^ ACAGAGAACC AGTTATTAAC 

(2) INFORMATION FOR SEQ ID NO: 44: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-44. 

mmmmmm 

CACCCTTGAC CAACGCTTTG CGAAATGGAT TTGTGATGGC GGCAACTACT AA 
(2) INFORMATION FOR SEQ ID NO: 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 



SJSJSSg rl.^^^r''^ GAGTGCTTGC TCAACGAGCG TTGAACATGG CGGATTGTCT 

mmmmm^^ 

— - ~i SEi ~ -™ 

cSJISSJ ss.— ~s -~ «~ 



60 
120 
180 
240 
300 
331 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
592 



60 
120 
180 
240 
300 
360 
420 
480 
540 
567 
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(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GAGCGAAAGA CCGAGGGCAG NGNNTANGNG CGANGAAGCG GAGAGGGCCA AAAAGCAACC 6 0 

GCTTTCCCCG GGGGGTGCCG ATTCATTAAG GCAGGTGGAG GACAGGTTTC CCGATGGAAG 12 0 

GCGGCAGGGG CGCAAGCAAT TAATGTGAGT AGGCCATTCA TTAGCACCCG GGCTTAACAT 18 0 

TTAAGCTTCG GGTTGGTATG TGGTGGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA 24 0 

CAGCTATGAC CATGATTACG CCAAGCTATT TAGGTGACAT TATAGAATAA CTCAAGTTAT 3 00 

GCATCAAGCT TGGTACCGAG TTCGGATCCA CTAGTAACGG CCGCCAGTGT GTGGAATTCG 3 60 

GCTTAGTAGT TGCCGACCAT GGAGTGCTAC CTAGGCTAGA ATACCTGAGY TCCTCCCTAG 42 0 

CCTCACTCAC ATTAAATTGT ATCTTTTCTA CATTAGATGT CCTCAGCGCC TTATTTCTGC 4 80 

TGGACWATCG ATAAATTAAT CCTGATAGGA TGATAGCAGC AGATTAATTA CTGAGAGTAT 54 0 

GTTAATGTGT CATCCCTCCT ATATAACGTA TTTGCATTTT AATGGAGCAA TTCTGGAGAT 60 0 

AATCCCTGAA GGCAAAGGAA TGAATCTTGA GGGTGAGAAA GCCAGAATCA GTGTCCAGCT 660 

GCAGTTGTGG GAGAAGGTGA TATTATGTAT GTCTCAGAAG TGACACCATA TGGGCAACTA 72 0 

CTAAGCCCGA ATTCCAGCAC ACTGGCGGGC GTTACTAATG GATCCGAGCT CGGTACC7VAG 7 80 

CTTGATGCAT AGCTTGAGTA TCTATAGTGT CACTAAATAG CCTGGCGTTA TCATGGTCAT 84 0 

AGCTGTTTCC TGTGTGAAAT TGTTATCCGC TCCCAATTCC CCCCACCATA CGAGCCGGAA 900 

CATAAAGT 90 8 

(2) INFOR^4ATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: ■ 

TGCCAACAAG GAAAGTTTTA AATTTCCCCT TGAGGATTCT TGGTGATCAT CAAATTCAGT 60 

GGTTTTTAAG GTTGTTTTCT GTCAAATAAC TCTAACTTTA AGCCAAACAG TATATGGAAG 120 

CACAGATAKA ATATTACACA GATAAAAGAG GAGTTGATCT AAAGTARAGA TAGTTGGGGG 180 

CTTTAATTTC TGGAACCTAG GTCTCCCCAT CTTCTTCTGT GCTGAGGAAC TTCTTGGAAG 24 0 

CGGGGATTCT AAAGTTCTTT GGAAGACAGT TTGAAAACCA CCATGTTGTT CTCAGTACCT 3 00 

TTATTTTTAA AAAGTAGGTG AACATTTTGA GAGAGAAAAG GGGTTGGTTG AGATGAAGTC 360 

CCCCCCCCCC CTTTTTTTTT TTTTAGCTGA AATAGATACC CTATGTTNAA RGAARGGATT 420 

ATTATTTACC ATGCCAYTAR SCACATGCTC TTTGATGGGC NYCTCCSTAC CCTCCTTAAG 4 80 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 : 

AAGAGGGTAC CGAGTGGAAT TTCCGCTTCA CTAGTCTGGT GTGGCTAGTC GGTTTCGTGG 
TGGCCAACAT TACGAACTTC CAACTCAACC GTTCTTGGAC GTTCAAGCGG GAGTACCGGC 
GAGGATGGTG GCGTGAATTC TGGCCTTTCT TTGCCGTGGG ATCGGTAGCC GCCATCATCG 
GTATGTTTAT CAAGATCTTC TTTACTAACC CGACCTCTCC GATTTACCTG CCCGAGCCGT 
GGTTTAACGA GGGGAGGGGG ATCCAGTCAC GCGAGTACTG GTCCCAGATC TTCGCCATCG 
TCGTGACAAT GCCTATCAAC TTCGTCGTCA ATAAGTTGTG GACCTTCCGA ACGGTGAAGC 
ACTCCGAAAA CGTCCGGTGG CTGCTGTGCG GTGACTCCCA AAATCTTGAT AACAACAAGG 
TAACCGAATC GCGCTAAGGA ACCCCGGCAT CTCGGGTACT CTGCATATGC GTACCCCTTA 
AGCCGAATTC CAGCACACTG GCGGCCGTTA CTAATTGGAT CCGAACTCCG TAACCAAGCC 
TGATGCGTAA CTTGAGTTAT TCTATAGTGT CCCTAAAATA ACCTGGCGTT A 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

AAGAGGGTAC CTGCCTTGAA ATTTAAATGT CTAAGGAAAR TGGGAGATGA TTAAGAGTTG 
GTGTGGCYTA GTCACACCAA AATGTATTTA TTACATCCTG CTCCTTTCTA GTTGACAGGA 
AAGAAAGCTG CTGTGGGGAA AGGAGGGATA AATACTGAAG GGATTTACTA AACAAATGTC 
CATCACAGAG TTTTCCTTTT TTTTTTTTTG AGACAGAGTC TTGCTCTGTC ACCCAGGCTG 
GAATGAAGWG GTATGATCTC AGTTGAATGC AACCTCTACC TCCTAGGTTC AAGCGATTCT 
CATGCCTCAG CCTCCTGAGC AGCTGGGACT ATAGGCGCAT GCTACCATGC CAGGCTAATT 
TTTATATTTT TATTAGAGAC GGGGTGTTGC CATGTTGGCC AGGCAGGTCT CGAACTCCTG 
GGCCTCAGAT GATCTGCCCC ACCGTACCCT CTTA 

(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) . TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

AAGAGGGTAC CAAAAAAAAG AAAAAGGAAA AAAAGAAAAA CAACTTGTAT AAGGCTTTCT 
GCTGCATACA GCTTTTTTTT TTTAAATAAA TGGTGCCAAC AAATCTT^^TT GCATTCACAC 
CAATTGCTGG TTTTGAAATC GTACTCTTCA AAGGTATTTG TGCAGATCAA TCCAATAGTG 
ATGCCCCGTA GGTTTTGTGG ACTGCCCACG TTGTCTACCT TCTCATGTAG GAGCCATTGA 
GAGACTGTTT GGACATGCCT GTGTTCATGT AGCCGTGATG TCCGGGGGCC GTGTACATCA 
TGTTACCGTG GGGTGGGGTC TGCATTGGCT GCTGGGCATA TGGCTGGGTG CCCATCATGC 
CCATCTGCAT CTGCATAGGG TATTGGGGCG TTTGATCCAT ATAGCCATGA TTGCTGTGGT 
AGCCACTGTT CATCATTGGC TGGGACATGC TGTTACCCTC TTA 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 1 : 

CTTCAACCTC CCAAAGTGCT GGGATTACAG GACTGAGCCA CCACGCTCAG CCTAAGCCTC 

TTTTTCACTA CCCTCTAAGC GATCTACCAC AGTGATGAGG GGCTAAAGAG CAGTGCAATT 

TGATTACAAT AATGGAACTT AGATTTATTA ATTAACAATT TTTCCTTAGC ATGTTGGTTC 

CATAATTATT AAGAGTATGG ACTTACTTAG AAATGAGCTT TCATTTTAAG AATTTCATCT 

TTGACCTTCT CTATTAGTCT GAGCAGTATG ACACTATACG TATTTTATTT AACTAACCTA 

CCTTGAGCTA TTACTTTTTA AAAGGCTATA TACATGAATG TGTATTGTCA ACTGTAAAGC 
CCCACAGTAT TTAATTATAT CATGATGTCT TTGAGGTTG 

(2) INFORMATION FOR SEQ ID NO: 52: 
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(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 3 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

CTTCAACCTC AATCAACCTT GGTAATTGAT AAAATCATCA CTTAACTTTC TGATATAATG 6 0 

GCAATAATTA TCTGAGAAAA AAAAGTGGTG AAAGATTAAA CTTGCATTTC TCTCAGAATC 12 0 

TTGAAGGATA TTTGAATAAT TCAAAAGCGG AATCAGTAGT ATCAGCCGAA GAAACTCACT 180 

TAGCTAGAAC GTTGGACCCA TGGATCTAAG TCCCTGCCCT TCCACTAACC AGCTGATTGG 24 0 

TTTTGTGTAA ACCTCCTACA CGCTTGGGCT TGGTCGCCTC ATTTGTCAAA GTAAAGGCTG 3 00 

AAATAGGAAG ATAATGAACC GTGTCTTTTT GGTCTCTTTT CCATCCATTA CTCTGATTTT 360 
ACAAAGAGGC CTGTATTCCC CTGGTGAGGT TG 



(2) INFORMATION FOR SEQ ID NO: 53 



3 92 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

TTCGGGTGAT GCCTCCTCAG GCTACAGTGA AGACTGGATT ACAGAAAGGT GCCAGCGAGA 60 

TTTCAGATTC CTGTAAACCT CTAAAGAAAA GGAGTCGCGC CTCAACTGAT GTAGAAATGA 12 0 

CTAGTTCAGC ATACNGAGAC ACNTCTGACT CCGATTCTAG AGGACTGAGT GACCTGCAN 179 



(2) INFORMATION FOR SEQ ID NO:54: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE; nucleic acid 

(C) StRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 



TTCGGGTGAT GCCTCCTCAG GCTACATCAT NATAGAAGCA AAGTAGAANA ATCNNGTTTG 60 
TGCATTTTCC CACANACAAA ATTCAAATGA NTGGAAGAAA TTGGGANAGT ^l'''^^™ 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

TGAGCTTCCG CTTCTGACAA CTCAATAGAT AATCAAAGGA CAACTTTAAC AGGGATTCAC 

AAAGGAGTAT ATCCAAATGC CAATAAACAT ATAAAAAGGA ATTCAGCTTC AtcATCATCA 

GAAGWATGCA AATTAAAACC ATAATGAGAA ACCACTATGT CCCACTAGAA TAGATAAAAT 
CTTAAAAGAC TGGTAAAACC AAGTGTTGGT AAGGCAAGAG GAGCA 2 25 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GCTCCTCTTG CCTTACCAAC ACATTCTCAA AAACCTGTTA GAGTCCTAAG CATTCTCCTG 

ttagtattgg gattttaccc ctgtcctata aagatgttat gtaccaaaaa tgaagtggag 

GGCCATACCC TGAGGGAGGG GAGGGATCTC TAGTGTTGTC AGAAGCGGAA GCTCA 
(2) INFORMATION FOR SEQ ID NO:57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
AGCCATTTAC CACCCATGGA TGAATGGATT TTGTAATTCT AGCTGTTGTA TTTTGTGAAT 
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TTGTTAATTT TGTTGTTTTT CTGTGAAACA CATACATTGG ATATGGGAGG TAAAGGAGTG 
TCCCAGTTGC TCCTGGTCAC TCCCTTTATA GCCATTACTG TCTTGTTTCT TGTAACTCAG 
GTTAGGTTTT GGTCTCTCTT GCTCCACTGC AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO; 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GTTCGAAGGT GAACGTGTAG GTAGCGGATC TCACAACTGG GGAACTGTCA AAGACGAATT 

AACTGACTTG GATCAATCAA ATGTGACTGA GGAAACACCT GAAGGTGAAG AACATCATCC 

AGTGGCAGAC ACTGAAAATA AGGAGAATGA AGTTGAAGAG GTAAAAGAGG AGGGTCCAAA 

AGAGATGACT TTGGATGGGT GGTAAATGGC T 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS :. single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GCTCCTCTTG CCTTACCAAC TTTGCACCCA TCATCAACCA TGTGGCCAGG TTTGCAGCCC 

AGGCTGCACA TCAGGGGACT GCCTCGCAAT ACTTCATGCT GTTGCTGCTG ACTGATGGTG 

CTGTGACGGA TGTGGAAGCC ACACGTGAGG CTGTGGTGCG TGCCTCGAAC CTGCCCATGT 
CAGTGATCAT TATGGGTGGT AAATGGCT 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

AGCCATTTAC CACCCATACT AAATTCTAGT TCAAACTCCA ACTTCTTCCA TAAAACATCT 
AACCACTGAC ACCAGTTGGC AATAGCTTCT TCCTTCTTTA ACCTCTTAGA GTATTTATGG 
TCAATGCCAC ACATTTCTGC AACTGAATAA AGTTGGTAAG GCAAGAGGAG C 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 base pairs 
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(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

CGGGTGATGC CTCCTCAGGC TTTGGTGTGT CCACTCNACT CACTGGCCTC TTCTCCAGCA 60 
SSS^cJJ? Totr"""" CNCC ACACGCNGCT CAGGGTGGGO TGGgS^cS 

134 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

AGAGGGTACA TATGCAACAG TATATAAAGG AAGAAGTGCA CTGAGAGGAA CTTCATCAAG 6 0 

GCCATTTAAT CAATAAGTGA TAGAGTCAAG GCTCAACCCA GGTGTGACGG ATTCcIgSJc ^20 
CCAAGCTCCT TACTGGTACC CTCTT ^±^^^^^^1^ .2 0 

145 

(2) INFORMATION FOR SEQ ID NO : 63 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

TGCACTGAGA GGAATTCAAA GGGTTTATGC CAAAGAACAA ACCAGTCCTC TGCAGCCTAA 
rrSl^Jnl^ TTTGGGCTGC GAAGCCATGT AGAGGGCGAT CAGGCAGTAG ATGGTCCCTC 
CCACAGTCAG CGCCATGGTG GTCCGGTAAA GCATTTGGTC AGGCAGGCCT CGTTTCAGGT 
AGACGGGCAC ACATCAGCTT TCTGGAAAAA CTTTTGTAGC TCTGGAGCTT TGTTTTTCCC 
AGCATAATCA TACACTGTGG AATCGGAGGT CAGTTTAGTT GGTAAGGCAA GAGGAGC 2 97 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 
GCACTGAGAG GAACTTCCAA TACTATGTTG AATAGGAGTG GTGAGAGAGG GCATCCTTGT 60 
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CTTGTGCCGG TTTTCAAAGG GAATGCTTCC AGCTTTTGCC CATTCAGTAT AATATTAAAG 
AATGTTTTAC CATTTTCTGT CTTGCCTGTT TTTCTGTGTT TTTGTTGGTC TCTTCATTCT 
CCATTTTTAG GCCTTTACAT GTTAGGAATA TATTTCTTTT AATGATACTT CACCTTTGGT 
ATCTTTTGTG AGACTCTACT CATAGTGTGA TAAGCACTGG GTTGGTAAGG CAAGAGGAGC 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



GCTCCTCTTG CCTTACCAAC TCACCCAGTA TGTCAGCAAT TTTATCRGCT TTACCTACGA 

AACAGCCTGT ATCCAAACAC TTAACACACT CACCTGAAAA GTTCAGGCAA CAATCGCCTT 

CTCATGGGTC TCTCTGCTCC AGTTCTGAAC CTTTCTCTTT TCCTAGAACA TGCATTTARG 18 0 

TCGATAGAAG TTCCTCTCAG TGC 



(2) INFORMATION FOR SEQ ID NO: 66: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

TACGGGGACC CCTGCATTGA GAAAGCGAGA CTCACTCTGA AGCTGAAATG CTGTTGCCCT 
TGCAGTGCTG GTAGCAGGAG TTCTGTGCTT TGTGGGCTAA GGCTCCTGGA TGACCCCTGA 
CATGGAGAAG GCAGAGTTGT GTGCCCCTTC TCATGGCCTC GTCAAGGCAT CATGGACTGC 
CACACACAAA ATGCCGTTTT TATTAACGAC ATGAAATTGA AGGAGAGAAC ACAATTCACT 
GATGTGGCTC GTAACCATGG ATATGGTCAC ATACAGAGGT GTGATTATGT AAAGGTTAAT 
TCCACCCACC TCATGTGGAA ACTAGCCTCA ATGCAGGGGT CCCA 

(2) INFORMATION FOR SEQ ID NO:67: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

GCACTGAGAG GAACTTCGTA GGGAGGTTGA ACTGGCTGCT GAGGAGGGGG AACAACAGGG 
TAACCAGACT GATAGCCATT GGATGGATAA TATGGTGGTT GAGGAGGGAC ACTACTTATA 
GCAGAGGGTT GTGTATAGCC TGAGGAGGCA TCACCCG 
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(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

S^™ sss™ ^^^^^ - 

TGAGGAGGCA TCACCCG CACGCCTGTA ATCCCAGAAC TTTGGGAGCC 12 0 

137 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6S: 

SSS^^ l^^^i ~- -™ eo 

GAAGTTCCTC TCAGTGC TTCTACCTGT CAAAATCCAT TTCAAATGTT 12 0 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



137 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 70- 

== = = =3 — == 

CCAGGGTMYA SGGACMASGG TGGGAYTYCA JcaJJ^SS "^^^^^^^^^ NNAAGTACTR 
(2) INFORMATION FOR SEQ ID NO: 71: 
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(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 



CGTTAGGGTC TCTATCCACT GCTAAACCAT ACACCTGGGT AAACAGGGAC CATTTAACAT 
TCCCANCTAA ATATGCCAAG TGACTTCACA TGTTTATCTT AAAGATGTCC AAAACGCAAC 
TGATTTTCTC CCCTAAACCT GTGATGGTGG GATGATTAAN CCTGAGTGGT CTACAGCAAG 
TTAAGTGCAA GGTGCTAAAT GAANGTGACC TGAGATACAG CATCTACAAG GCAGTACCTC 
TCAACNCAGG GCAACTTTGC TTCTCANAGG GCATTTAGCA GTGTCTGAAG TAATTTCTGT 
ATTACAACTC ACGGGGCGGG GGGTGAATAT CTANTGGANA GNAGACCCTA ACG 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

GCACTGAGAG GAACTTCCAA TACYATKATC AGAGTGAACA RGCARCCYAC AGAACAGGAG 
AAAATGTTYG CAATCTCTCC ATCTGACAAA AGGCTAATAT CCAGAWTCTA AWAGGAACTT 
AAACAAATTT ATGAGAAAAG AACARACAAC CTCAWCAAAA AGTGGGTGAA GGAWATGCTS 
AAARGAAGAC ATYTATTCAG CCAGTAAACA YATGAAAAAA AGGCTCATSA TCACTGAWCA 
TTAGAGAAAT GCAAATCAAA. ACCACAATGA GATACCATCT YAYRCCAGTT AGAAYGGTGA 
TCATTAAAAR STCAGGAAAC AACAGATGCT GGACAAGGTG TCA 

(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GCACTGAGAG GAACTTCAGA GAGAGAGAGA GAGTTCCACC CTGTACTTGG GGAGAGAAAC 
AGAAGGTGAG AAAGTCTTTG GTTCTGAAGC AGCTTCTAAG ATCTTTTCAT TTGCTTCATT 
TCAAAGTTCC CATGCTGCCA AAGTGCCATC CTTTGGGGTA CTGTTTTCTG AGCTCCAGTG 
ATAACTCATT TATACAAGGG AGATACCCAG AAAAAAAGTG AGCAAATCTT AAAAAGGTGG 
CTTGAGTTCA GCCTTAAATA CCATCTTGAA ATGACACAGA GAAAGAANGA TGTTGGGTGG 
GAGTGGATAG AGACCCTAAC G 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 
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GCACTGAGAG GAACTTCAGA GAGAGAGAGA GAGTTCCACC CTGTACTTGG rr^n^n.^.r. 



(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base paij 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 



mmmmmm 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0-7e- 

m m = ~ = 

GTGC CCCTAGCCCC TTACACCCCC AACAGGCTCT AGTGTGTGAA GTTCCTCTCA 

(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CGTTAGGGTC TCTATCCACT GAAATCTGAA GCACAGGAGG AAGAGAAGCA GTYCTAGTGA 
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GATGGCAAGT TCWTTTACCA CACTCTTTAA CATTTYGTTT AGTTTTAACC TTTATTTATG 
GATAATAAAG GTTAATATTA ATAATGATTT ATTTTAAGGC ATTCCCRAAT TTGCATAATT 
CTCCTTTTGG AGATACCCTT TTATCTCCAG TGCAAGTCTG GATCAAAGTG ATASAMAGAA 
GTTCCTCTCA GTGC 

(2) INFORMATION FOR SEQ ID NO: 78: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

TTCGATACAG GCAAACATGA ACTGCAGGAG GGTGGTGACG ATCATGATGT TGCCGATGGT 

CCGGATGGNC ACGAAGACGC ACtGGANCAC GTGCTTACGT CCTTTTGCTC TGTTGATGGC 

CCTGAGGGGA CGCAGGACCC TTATGACCCT CAGAATCTTC ACAACGGGAG ATGGCACTGG 

ATTGANTCCC ANTGACACCA GAGACACCCC AACCACCAGN ATATCANTAT ATTGATGTAG 

TTCCTGTAGA NGGCCCCCTT GTGGAGGAAA GCTCCATNAG TTGGTCATCT TCAACAGGAT 

CTCAACAGTT TCCGATGGCT GTGATGGGCA TAGTCATANT TAACCNTGTN TCGAA 

(2) INFORMATION FOR SEQ ID NO: 79: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
- (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 



TAAGAGGGTA CCAGCAGAAA GGTTAGTATC ATCAGATAGC ATCTTATACG AGTAATATGC 

CTGCTATTTG AAGTGTAATT GAGAAGGAAA ATTTTAGCGT GCTCACTGAC CTGCCTGTAG 

CCCCAGTGAC AGCTAGGATG TGCATTCTCC AGCCATCAAG AGACTGAGTC AAGTTGTTCC 

TTAAGTCAGA ACAGCAGACT CAGCTCTGAC ATTCTGATTC GAATGACACT GTTCAGGAAT 

CGGAATCCTG TCGATTAGAC TGGACAGCTT GTGGCAAGTG AATTTGCCTG TAACAAGCCA 

GATTTTTTAA AATTTATATT GTAAATAATG TGTGTGTGTG TGTGTGTATA TATATATATA 
TGTACAGTTA TCTAAGTTAA TTTAAAAGTT GTTTGGTACC CTCTTA 

(2) INFORMATION FOR SEQ ID NO: 80: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 0 : 

^^^'^T'^TTTT TTTACTCGGC TCAGTCTAAT CCTTTTTGTA GTCACTCATA GGCCAGACTT 60 
AGGGCTAGGA TGATGATTAA TAAGAGGGAT GACATAACTA TTAGTGGCAG GTTAGTTGTT 12 0 
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TGTAGGGCTC ATGGTAGGGG TAAAAGGAGG GCAATTTCTA GATCAAATAA TAAGAAGGTA 
ATAGCTACTA AGAAGAATTT TATGGAGAAA GGGACGCGGG CGGgSJJ^ I^gJJgSg 
CCGCACTCGT AAGGGGTGGA TTTTTCTATG TAGCCGTTGA GTTGTGgSg TCpSJIt^I 
ATAATTATTA GTAGTAAGCC TAGGAGA ^i^v^iA^^ iCAAAATGTA 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 

TAGTCTATGC GGTTGATTCG GCAATCCATT ATTTGCTGGA TTTTGTCATG TGT-TTGCCa 
^r^^^f TAATTTATTA TGCATTTATG CTTGTATCTC CTAAgJcJtc IZtII^TC 

catgcttttt atgttttgtc tgacataaac tcttatcaga gcccttSca SSSJ^S 

CAATAAATAT TAACACAGTC TACATTTATT TGGTGAATAT TGCATATCTG SSaSSH 

J?o™ - 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

^r^^^J^^r CTACTCCCAC TAATAGCTTT TTGATGACTT CTAGCAAGCC TCGCTAACCT 
CGCCTTACCC CCCACTATTA ACCTACTGGG AGAACTCTCT GTGCTAGTAA CCACcJJcS 
CTGATCAAAT ATCACTCTCC TACTTACAGG ACTCAACATA CTaSJacJS cSSSSc 
CCTCTACATA TTTACCACAA CACAATGGGG CTCACTCACC CACCACa5Ja IcZcIt^ 
ACCCTCATTC ACACGAGAAA ACACCCTCAT GTTCATACAC CTATCCCcS ^cSJ^ 
ATCCCTCAAC CCCGACATCA TTACCGGGTT TTCCTCTT ^^^^^^CA TTCTCCTCCT 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

JSgaSSg SS™?? ^^^^^^^^^ AAAAAAAAAG AAAAATATCA AGGAATAAAA 
ATAGACTTTG AACAAAAAGG AACATTTGCT GGCCTGAGGA GGCATCACCC G 
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(2) INFORMATION FOR SEQ ID NO:84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 

TCGGGTGATG CCTCCTCAGG CCAAGAAGAT AAAGCTTCAG ACCCCTAACA CATTTCCAAA 
AAGGAAGAAA GGAGAAAAAA GGGCATCATC CCCGTTCCGA AGGGTCAGGG AGGAGGAAAT 
TGAGGTGGAT TCACGAGTTG CGGACAACTC CTTTGATGCC AAGCGAGGTG CAGCCGGAGA 
CTGGGGAGAG CGAGCCAATC AGGTTTTGAA GTTCCTCTCA GTGC 

(2) INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 

GCACTGAGAG GAACTTCGTT GGAAACGGGT TTTTTTCATG TAAGGCTAGA CAGAAGAATT 
CTCAGTAACT TCCTTGTGTT GTGTGTATTC AACTCACASA GTTGAACGAT CCTTTACACA 
GAGCAGACTT GTAACACTCT TWTTGTGGAA TTTGCAAGTG GAGATTTCAG SCGCTTTGAA 
GTSAAAGGTA GAAAAGGAAA TATCTTCCTA TAAAAACTAG ACAGT^TGAT TCTCAGAAAC 
TCCTTTGTGA TGTGTGCGTT CAACTCACAG AGTTTAACCT TTCWTTTCAT AGAAGCAGTT 
AGGAAACACT CTGTTTGTAA AGTCTGCAAG TGGATAGAGA CCCTAACG 

(2) INFORMATION FOR SEQ ID NO: 86: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

GCACTGAGAG GAACTTCYTT GTGWTGTKTG YATTCAACTC ACAGAGTTGA ASSWTSMTTT 
ACABAGWKCA GGCTTKCAAA CACTCTTTTT GTMGAATYTG CAAGWGGAKA TTTSRRCCRC 
TTTGWGGYCW WYSKTMGAAW MGGRWATATC TTCWYATMRA AMCTAGACAG AAKSATTCTC 
AKAAWSTYYY YTGTGAWGWS TGCRTTCAAC TCACAGAGKT KAACMWTYCT KYTSATRGAG 
CAGTTWKGAA ACTCTMTTTC TTTGGATTCT GCAAGTGGAT AGAGACCCTA ACG 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
CTCCTAGGCT 

10 

(2) INFORMATION FOR SEQ ID NO:88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 
AGTAGTTGCC 

10 

(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 9 : 
TTCCGTTATG C 

11 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID "NO: 90: 
TGGTAAAGGG 

10 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
TCGGTCATAG 

10 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 92 : 
TACAACGAGG 

10 

(2) INFORMATION FOR SEQ ID NO : 93 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 
TGGATTGGTC 

10 

(2) INFORMATION FOR SEQ ID NO : 94 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 : 
CTTTCTACCC 

10 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
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TTTTGGCTCC 

10 

(2) INFORMATION FOR SEQ ID NO:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GGAACCAATC 

10 

(2) INFORMATION FOR SEQ ID NO : 97 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
TCGATACAGG 

10 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 
GGTACTAAGG 

10 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANTDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
AGTCTATGCG 

10 
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(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
CTATCCATGG 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: .single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TCTGTCCACA 

(2) INFORMATION FOR SEQ ID N0:102: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
AAGAGGGTAC 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CTTCAACCTC 

(2) INFORMATION FOR SEQ ID NO:104: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
GCTCCTCTTG CCTTACCAAC 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid" 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GTAAGTCGAG CAGTGTGATG 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 6: 
GTAAGTCGAG CAGTCTGATG 
(2) INFORMATION FOR SEQ ID N0:107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
GACTTAGTGG AAAGAATGTA 
(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
GTAATTCCGC CAACCGTAGT 
(2) INFORMATION FOR SEQ ID NO: 10 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
ATGGTTGATC GATAGTGGAA 
(2) INFORMATION FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
ACGGGGACCC CTGCATTGAG 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TATTCTAGAC CATTCGCTAC 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 112 : 
ACATAACCAC TTTAGCGTTC 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
CGGGTGATGC CTCCTCAGGC 
(2) INFORMATION FOR SEQ ID NO:114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
AGCATGTTGA GCCCAGACAC 

(2) INFORMATION FOR SEQ ID N0:115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GACACCTTGT CCAGCATCTG | 
(2) INFORMATION FOR SEQ ID NO: 116: | 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
TACGCTGCAA CACTGTGGAG 
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(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 117 : 
CGTTAGGGTC TCTATCCACT 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: 
AGACTGACTC ATGTCCCCTA 
(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STI^TU^DEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
TCATCGCTCG GTGACTCAAG 
(2) INFORMATION FOR SEQ ID NO: 12 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0: 
CAAGATTCCA TAGGCTGACC 
(2) INFORMATION FOR SEQ ID NO: 121: 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
ACGTACTGGT CTTGAAGGTC 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GACGCTTGGC CACTTGACAC 

(2) INFORMATION FOR SEQ ID NO: 12 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 
GTATCGACGT AGTGGTCTCC 
(2) INFORMATION FOR SEQ ID NO: 12 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
. (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
TAGTGACATT ACGACGCTGG 
(2) INFORMATION FOR SEQ ID NO: 12 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 



BNSDOCID: <WO 9845328A2J. 



wo 98/45328 



PCT/US98/06939 



75 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
CGGGTGATGC CTCCTCAGGC 2 0 

(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 6: 
ATGGCTATTT TCGGGGGCTG ACA 2 3 

(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
CCGGTATCTC CTCGTGGGTA TT 2 2 

(2.) INFORMATION FOR SEQ ID NO: 12 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 
CTGCCTGAGC CACAAATG 18. 



(2) INFORMATION FOR SEQ ID NO: 12 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 12 9:^ 
CCGGAGGAGG AAGCTAGAGG AATA 
(2) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 0: 
TTTTTTTTTT TTAG 

(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Ser Ser Gly Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val 
Gly He 15 

(2) INFORMATION FOR SEQ ID NO: 13 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:132: 

Gin Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Xaa He Glu Val 

Val Gin Gly His Asp Glu ^° 
20 

(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 23 amino acids 

(B) TYPE: amino acid 



24 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 

Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu Ala Tyr Arg He Tyr 
15 10 15 

Thr Pro Phe Asp Leu Ser Ala 
20 

(2) INFORMATION FOR SEQ ID NO : 134 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:134: 

Tyr Leu Leu Val Gly lie Gin Gly Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13 5': 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Gly Ala Ala Gin Lys Pro lie Asn Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 6: 

Asn Leu Ser Lys Xaa lie Glu Val Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 137: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7: 

Glu Val Val Gin Gly His Asp Glu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRAITOEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8: 

His Leu Gin Glu Ala Tyr Arg lie Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

Asn Leu Ala Phe Val Ala Gin Ala Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0: 

Phe Val Ala Gin Ala Ala Pro Asp Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 141: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93-88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

GCTCGCGGCC GCGAGCTCAA TTAACCCTCA CTAAAGGGAG TCGACTCGAT CAGACTGTTA 60 

CTGTGTCTAT GTAGAAAGAA GTAGACATAA GAGATTCCAT TTTGTTCTGT ACTAAGAAAA 12 0 

ATTCTTCTGC CTTGAGATGC TGTTAATCTG TAACCCTAGC CCCAACCCTG TGCTCACAGA 18 0 

GACATGTGCT GTGTTGACTC AAGGTTCAAT GGATTTAGGG CTATGCTTTG TTAAAAAAGT 24 0 

GCTTGAAGAT AATATGCTTG TTAAAAGTCA TCACCATTCT CTAATCTCAA GTACCCAGGG 3 00 

ACACAATACA CTGCGGAAGG CCGCAGGGAC CTCTGTCTAG GAAAGCCAGG TATTGTCCAA 3 60 

GATTTCTCCC CATGTGATAG CCTGAGATAT GGCCTCATGG GAAGGGTAAG ACCTGACTGT 42 0 

CCCCCAGCCC GACATCCCCC AGCCCGACAT CCCCCAGCCC GACACCCGAA AAGGGTCTGT 4 80 

GCTGAGGAGG ATTAGTAAAA GAGGAAGGCC TCTTTGCAGT TGAGGTAAGA GGAAGGCATC 54 0 

TGTCTCCTGC TCGTCCCTGG GCAATAGAAT GTCTTGGTGT AAAACCCGAT TGTATGTTCT 600 

ACTTACTGAG ATAGGAGAAA ACATCCTTAG GGCTGGAGGT GAGACACGCT GGCGGCAATA 660 

CTGCTCTTTA ATGCACCGAG ATGTTTGTAT AAGTGCACAT CAAGGCACAG CACCTTTCCT 72 0 

TAAACTTATT TATGACACAG AGACCTTTGT TCACGTTTTC CTGCTGACCC TCTCCCCACT 78 0 

ATTACCCTAT TGGCCTGCCA CATCCCCCTC TCCGAGATGG TAGAGATAAT GATCAATAAA 84 0 

TACTGAGGGA ACTCAGAGAC CAGTGTCCCT GTAGGTCCTC CGTGTGCTGA GCGCCGGTCC 90 0 

CTTGGGCTCA CTTTTCTTTC TCTATACTTT GTCTCTGTGT CTCTTTCTTT TCTCAGTCTC 96 0 

TCGTTCCACC TGACGAGAAA TACCCACAGG TGTGGAGGGG CAGGCCACCC CTTCAATAAT 102 0 

TTACTAGCCT GTTCGCTGAC AACAAGACTG GTGGTGCAGA AGGTTGGGTC TTGGTGTTCA 108 0 

CCGGGTGGCA GGCATGGGCC AGGTGGGAGG GTCTCCAGCG CCTGGTGCAA ATCTCCAAGA 114 0 

AAGTGCAGGA AACAGCACCA AGGGTGATTG TAAATTTTGA TTTGGCGCGG CAGGTAGCCA 12 0 0 

TTCCAGCGCA AAAATGCGCA GGAAAGCTTT TGCTGTGCTT GTAGGCAGGT AGGCCCCAAG 12 6 0 

CACTTCTTAT TGGCTAATGT GGAGGGAACC TGCACATCCA TTGGCTGAAA TCTCCGTCTA 13 2 0 

TTTGAGGCTG ACTGAGCGCG TTCCTTTCTT CTGTGTTGCC TGGAAACGGA CTGTCTGCCT 13 8 0 

AGTAACATCT GATCACGTTT CCCATTGGCC GCCGTTTCCG GAAGCCCGCC CTCCCATTTC 144 0 

CGGAAGCCTG GCGCAAGGTT GGTCTGCAGG TGGCCTCCAG GTGCAAAGTG GGAAGTGTGA 150 0 

GTCCTCAGTC TTGGGCTATT CGGCCACGTG CCTGCCGGAC ATGGGACGCT GGAGGGTCAG 15 6 0 

CAGCGTGGAG TCCTGGCCTT TTGCGTCCAC GGGTGGGAAA TTGGCCATTG CCACGGCGGG 162 0 

AACTGGGACT CAGGCTGCCC CCCGGCCGTT TCTCATCCGT CCACCGGACT CGTGGGCGCT 16 8 0 

CGCACTGGCG CTGATGTAGT TTCCTGACCT CTGACCCGTA TTGTCTCCAG ATTAAAGGTA 174 0 

AAAACGGGGC TTTTTCAGCC CACTCGGGTA AAACGCCTTT TGATTTCTAG GCAGGTGTTT 18 00 

TGTTGCACGC CTGGGAGGGA GTGACCCGCA GGTTGAGGTT TATTAAAATA CATTCCTGGT 18 60 

TTATGTTATG TTTATAATAA AGCACCCCAA CCTTTACAAA ATCTCACTTT TTGCCAGTTG 192 0 

TATTATTTAG TGGACTGTCT CTGATAAGGA CAGCCAGTTA T^AATGGAATT TTGTTGTTGC 19 8 0 

TAATTAAACC AATTTTTAGT TTTGGTGTTT GTCCTAATAG CAACAACTTC TCAGGCTTTA 2 04 0 

TAAAACCATA TTTCTTGGGG GAAATTTCTG TGTAAGGCAC AGCGAGTTAG TTTGGAATTG 210 0 

TTTTAAAGGA AGTAAGTTCC TGGTTTTGAT ATCTTAGTAG TGTAATGCCC AACCTGGTTT 216 0 

TTACTAACCC TGTTTTTAGA CTCTCCCTTT CCTTAAATCA CCTAGCCTTG TTTCCACCTG 22 2 0 

AATTGACTCT CCCTTAGCTA AGAGCGCCAG ATGGACTCCA TCTTGGCTCT TTCACTGGCA 22 8 0 

GCCCCTTCCT CAAGGACTTA ACTTGTGCAA GCTGACTCCC AGCACATCCA AGAATGCAAT 2 34 0 

TAACTGTTAA GATACTGTGG CAAGCTATAT CCGCAGTTCC GAGGAATTCA TCCGATTGAT 24 0 0 

TATGCCCAAA AGCCCCGCGT CTATCACCTT GTAATAATCT TAAAGCCCCT GCACCTGGAA 24 60 

CTATTAACTT TCCTGTAACC ATTTATCCTT TTAACTTTTT TGCTTACTTT ATTTCTGTAA 2 52 0 

AATTGTTTTA ACTAGACCTC CCCTCCCCTT TCTAAACCAA AGTATAAAAG AAGATCTAGC 25 80 

CCCTTCTTCA GAGCGGAGAG AATTTTGAGC ATTAGCCATC TCTTGGCGGC CAGCTAAATA 2 64 0 

AATGGACTTT TAATTTGTCT CAAAGTGTGG CGTTTTCTCT AACTCGCTCA GGTACGACAT 2 70 0 
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80 

TTGGAGGCCC CAGCGAGAAA CGTCACCGGG AGAAACGTCA CCGGGCGAGA GCCGGGCCCG 2 7 50 
CTGTGTGCTC CCCCGGAAGG ACAGCCAGCT TGTAGGGGG3 AGTGCCACCT GAAAAAAAAA 2 820 
TTTCCAGGTC CCCAAAGGGT GACCGTCTTC CGGAGGACAG CGGATCGACT ACCATGC3GG 
TGCCCACCAA AATTCCACCT CTGAGTCCTC AACTGCTGAC CCCGGGGTCA GGTAGGTCAG 
ATTTGACTTT GGTTCTGGCA GAGGGAAGCG ACCCTGATGA GGGTGTCCCT CTTTTGACTC juuu 
TGCCCATTTC TCTAGGATGC TAGAGGGTAG AGCCCTGGTT TTCTGTTAGA CGCCTCTGTG 3 060 
TCTCTGTCTG GGAGGGAAGT GGCCCTGACA GGGGCCATCC CTTGAGTCAG TCCACATCCC 3120 
AGGATGCTGG GGGACTGAGT CCTGGTTTCT GGCAGACTGG TCTCTCTCTC TCTCTTTTTC 3180 
TATCTCTAAT CTTTCCTTGT TCAGGTTTCT TGGAGAATCT CTGGGAAAGA AAAAAGAAAA 3 240 
ACTGTTATAA ACTCTGTGTG AATGGTGAAT GAATGGGGGA GGACAAGGGC TTGCGCTTGT 3 3 00 
CCTCCAGTTT GTAGCTCCAC GGCGAAAGCT ACGGAGTTCA AGTGGGCCCT CACCTGCGGT 3 3 SO 
TCCGTGGCGA CCTCATAAGG CTTAAGGCAG CATCCGGCAT AGCTCGATCC GAGCCGGGGG 3 420 
TTTATACCGG CCTGTCAATG CTAAGAGGAG CCCAAGTCCC CTAAGGGGGA GCGGCCAGGC 34 80 
GGGCATCTGA CTGATCCCAT CACGGGACCC CCTCCCCTTG TTTGTCTAAA AAAAAAAAAA 3 54 0 
GAAGAAACTG TCATAACTGT TTACATGCCC TAGGGTCAAC TGTTTGTTTT ATGTTTATTG 3 600 
TTCTGTTCGG TGTCTATTGT CTTGTTTAGT GGTTGTCAAG GTTTTGCATG TCAGGACGTC 3 66 0 
GATATTGCCC AAGACGTCTG GGTAAGAACT TCTGCAAGGT CCTTAGTGCT GATTTTTTGT 3 72 0 
CACAGGAGGT TAAATTTCTC ATCAATCATT TAGGCTGGCC ACCACAGTCC TGTCTTTTCT 3 780 
GCCAGAAGCA AGTCAGGTGT TGTTACGGGA ATGAGTGTAA AAAAACATTC GCCTGATTGG ^84 0 

GATTTCTGGC ACCATGATGG TTGTATTTAG ATTGTCATAC CCCACATCCA GGTTGATTGG 3 90 0 
ACCTCCTCTA AACTAAACTG GTGGTGGGTT CAAAACAGCC ACCCTGCAGA TTTCCTTGCT 3 960 
CACCTCTTTG GTCATTCTGT AACTTTTCCT GTGCCCTTAA ATAGCACACT GTGTAGGGAA 4 02 0 
ACCTACCCTC GTACTGCTTT ACTTCGTTTA GATTCTTACT CTGTTCCTCT GTGGCTACTC 
TCCCATCTTA AAAACGATCC AAGTGGTCCT TTTCCTCCTC CCTGCCCCCT ACCCCACACA 
TCTCGTTTTC CAGTGCGACA GCAAGTTCAG CGTCTCCAGG ACTTGGCTCT GCTCTCACtc 
CTTGAACCCT TAAAAGAAAA AGCTGGGTTT GAGCTATTTG CCTTTGAGTC ATGGAGACAC 42 60 
AAAAGGTATT TAGGGTACAG ATCTAGAAGA AGAGAGAGAA CACCTAGATC CAACTGACCC 4 -=2 0 
AGGAGATCTC GGGCTGGCCT CTAGTCCTCC TCCCTCAATC TTAAAGCTAC AGTGATGTGG 43 80 

CAAGTGGTAT TTAGCTGTTG TGGTTTTTCT GCTCTTTCTG GTCATGTTGA TTCTGTTCTT 4 44 0 

TCGATACTCC AGCCCCCCAG GGAGTGAGTT TCTCTGTCTG TGCTGGGTTT GATATCTATG 4 5 00 

TTCAAATCTT ATTAAATTGC CTTCAAAAAA AAAAAAAAAA GGGAAACACT TCCTCCCAGC 4560 

CTTGTAAGGG TTGGAGCCCT CTCCAGTATA TGCTGCAGAA TTTTTCTCTC GGTTTCTCAG 462 0 

AGGATTATGG AGTCCGCCTT AAAAAAGGCA AGCTCTGGAC ACTCTGCAAA GTAGAATGGC 4 680 

CAAAGTTTGG AGTTGAGTGG CCCCTTGAAG GGTCACTGAA CCTCACAATT GTTCAAGCTG 4 74 0 

TGTGGCGGGT TGTTACTGAA ACTCCCGGCC TCCCTGATCA GTTTCCCTAC ATTGATCAAT 4800 

GGCTGAGTTT GGTCAGGAGC ACCCCTTCCA TGGCTCCACT CATGCACCAT TCATAATTTT 48 60 

ACCTCCAAGG TCCTCCTGAG CCAGACCGTG TTTTCGCCTC GACCCTCAGC CGGTTCAGCT 4 92 0 

CGCCCTGTAC TGCCTCTCTC TGAAGAAGAG GAGAGTCTCC CTCACCCAGT CCCACCGCCT 498 0 

TAAAACCAGC CTACTCCCTT AGGGTCATCC CATGTCTCCT CGGCTATGTC CCCTGTAGGC 504 0 

TCATCACCCA TTGCCTCTTG GTTGCAACCG TGGTGGGAGG AAGTAGCCCC TCTACTACCA 5100 

CTGAGAGAGG CACAAGTCCC TCTGGGTGAT GAGTGCTCCA CCCCCTTCCT GGTTTATGTC 516 0 

CCTTCTTTCT ACTTCTGACT TGTATAATTG GAAAACCCAT AATCCTCCCT TrTCTGAAAA 522 0 

GCCCCAGGCT TTGACCTCAC TGATGGAGTC TGTACTCTGG ACACATTGGC CCACCTGGGA 52 8 0 

TGACTGTCAA CAGCTCCTTT TGACCCTTTT CACCTCTGAA GAGAGGGAAA GTATCCAAAG 534 0 

AGAGGCCAAA AAGTACAACC TCACATCAAC CAATAGGCCG GAGGAGGAAG CTAGAGGAAT 54 00 

AGTGATTAGA GACCCAATTG GGACCTAATT GGGACCCAAA TTTCTCAAGT GGAGGGAGAA 54 60 

CTTTTGACGA TTTCCACCGG TATCTCCTCG TGGGTATTCA GGGAGCTGCT CAGAAACCTA 552 0 

TAAACTTGTC TAAGGCGACT GAAGTCGTCC AGGGGCATGA TGAGTCACCA GGAGTGTTTT 55 3 0 

TAGAGCACCT CCAGGAGGCT TATCGGATTT ACACCCCTTT TGACCTGGCA GCCCCCGAAA 564 0 

ATAGCCATGC TCTTAATTTG GCATTTGTGG CTCAGGCAGC CCCAGATAGT AAAAGGAAAC 57 00 

TCCAAAAACT AGAGGGATTT TGCTGGAATG AATACCAGTC AGCTTTTAGA GATAGCCTAA 5760 

AAGGTTTTTG ACAGTCAAGA GGTTGAAAAA CAAAAACAAG CAGCTCAGGC AGCTGAAAAA 582 0 

AGCCACTGAT AAAGCATCCT GGAGTATCAG AGTTTACTGT TAGATCAGCC TCATTTGACT 58 8 0 

TCCCCTCCCA -ATGGTGTTT AAATCCAGCT ACACTACTTC CTGACTCAAA CTCCACTATT 594 0 

CCTGTTCATG ACTGTCAGGA ACTGTTGGAA ACTACTGAAA CTGGCCGACC TGATCTTCAA 60 00 
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AATGTGCCCC TAGGAAAGGT GG ATGCCACC GTGTTCACAG ACAGTAGCAG CTTCCTCGAG 60 60 

AAGGGACTAC GAAAGGCCGG TGCAGCTGTT ACCATGGAGA CAGATGTGTT GTGGGCTCAG 612 0 

GCTTTACCAG CAAACACCTC AGCACT^AAAG GCTGAATTGA TCGCCCTCAC TCAGGCTCTC 618 0 

CGATGGGGTA AGGATATTAA CGTTAACACT GACAGCAGGT ACGCCTTTGC TACTGTGCAT 624 0 

GTACGTGGAG CCATCTACCA GGAGCGTGGG CTACTCACCT CAGCAGGTGG CTGTAATCCA 63 00 

CTGTAAAGGA CATCAAAAGG AAAACACGGC TGTTGCCCGT GGTAACCAGA AAGCTGATTC 63 6 0 

AGCAGCTCAA GATGCAGTGT GACTTTCAGT CACGCCTCTA AACTTGCTGC CCACAGTCTC 64 2 0 

CTTTCCACAG CCAGATCTGC CTGACAATCC CGCATACTCA ACAGAAGAAG AAAACTGGCC 64 8 0 

TCAGAACTCA GAGCCAATAA AAATCAGGAA GGTTGGTGGA TTCTTCCTGA CTCTAGAATC 654 0 

TTCATACCCC GAACTCTTGG GAAAACTTTA ATCAGTCACC TACAGTCTAC CACCCATTTA 660 0 

GGAGGAGCAA AGCTACCTCA GCTCCTCCGG AGCCGTTTTA AGATCCCCCA TCTTCAAAGC 66 6 0 

CTAACAGATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC AGGTAAATGC CAAAAAAGGT 672 0 

CCTAAACCCA GCCCAGGCCA CCGTCTCCAA GAAAACTCAC CAGGAGAAAA GTGGGAAATT 67 8 0 

GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAT ACCTTCTAGT ACTGGTAGAC 6 84 0 

ACCTTCTCTG GATGGACTGA AGCATTTGCT ACCAAAAACG AAACTGTCAJ^ TATGGTAGTT 690 0 

AAGTTTTTAC TCAATGAAAT CATCCCTCGA CGTGGGCTGC CTGTTGCCAT AGGGTCTGAT 696 0 

AATGGACCGG CCTTCGCCTT GTCTATAGTT TAGTCAGTCA GTAAGGCGTT AJ^ACATTCAA 702 0 

TGGAAGCTCC ATTGTGCCTA TCGACCCCAG AGCTCTGGGC AAGTAGAACG CATGAACTGC 708 0 

ACCCTAAAAA ACACTCTTAC AAAATTAATC TTAGAAACCG GTGTAAATTG TGTAAGTCTC 714 0 

CTTCCTTTAG CCCTACTTAG AGTAAGGTGC ACCCCTTACT GGGCTGGGTT CTTACCTTTT 72 0 0 

GAAATCATGT ATGGGAGGGC GCTGCCTATC TTGCCTAAGC TAAGAGATGC CCAATTGGCA 72 6 0 

AAAATATCAC AAACTAATTT ATTACAGTAC CTACAGTCTC CCCAACAGGT ACAAGATATC 73 2 0 

ATCCTGCCAC TTGTTCGAGG AACCCATCCC AATCCAATTC CTGAACAGAC AGGGCCCTGC 73 8 0 

CATTCATTCC CGCCAGGTGA CCTGTTGTTT GTTAAAAAGT TCCAGAGAGA AGGACTCCCT 74 4 0 

CCTGCTTGGA AGAGACCTCA CACCGTCATC ACGATGCCAA CGGCTCTGAA GGTGGATGGC 75 0 0 

ATTCCTGCGT GGATTCATCA CTCCCGCATC AAAAAGGCCA ACGGAGCCCA ACTAGAAACA 756 0 

TGGGTCCCCA GGGCTGGGTC AGGCCCCTTA AAACTGCACC TAAGTTGGGT G/U^GCCATTA 7 62 0 

GATTAATTCT TTTTCTTAAT TTTGTAAAAC AATGCATAGC TTCTGTCAAA CTTATGTATC 768 0 

TTAAGACTCA ATATAACCCC CTTGTTATAA CTGAGGAATC AATGATTTGA TTCCCCAAAA 774 0 

ACACAAGTGG GGAATGTAGT GTCCAACCTG GTTTTTACTA ACCCTGTTTT TAGACTCTCC 78 0 0 

CTTTCCTTTA ATCACTCAGC CTTGTTTCCA GCTGAATTGA CTCTCCCTTA GCTAAGAGCG 7 86 0 

CCAGATGGAC TCCATCTTGG CTCTTTCACT GGCAGCCGCT TCCTCAAGGA CTTAACTTGT 7 92 0 

GCAAGCTGAC TCCCAGCACA TCCAAGAATG CAATTAACTG ATAAGATACT GTGGCAAGCT 7 98 0 

ATATCCGCAG TTCCCAGGAA TTCGTCCAAT TGATTACACC CAAAAGCCCC GCGTCTATCA 804 0 

CCTTGTAATA ATCTTAAAGC CCCTGCACCT GG7\ACTATTA ACGTTCCTGT AACCATTTAT 8100 

CCTTTTAACT TTTTTGCCTA CTTTATTTCT GTAAAATTGT TTTAACTAGA CCCCCCCTCT 816 0 

CCTTTCTAJ^ CCAAAGTATA AAAGCAAATC TAGCCCCTTC TTCAGGCCGA GAGAATTTCG 82 2 0 

AGCGTTAGCC GTCTCTTGGC CACCAGCTAA ATAAACGGAT TCTTCATGTG TCTCAAAGTG 82 8 0 

TGGCGTTTTC TCTAACTCGC TCAGGTACGA CCGTGGTAGT ATTTTCCCCA ACGTCTTATT 83 4 0 

TTTAGGGCAC GTATGTAGAG TAACTTTTAT GAAAGAAACC AGTTAAGGAG GTTTTGGGAT 84 0 0 

TTCCTTTATC AACTGTAATA CTGGTTTTGA TTATTTATTT ATTTATTTAT TTTTTTTGAG 84 6 0 

AAGGAGTTTC ACTCTTGTTG CCCAGGCTGG AGTGCAATGG TGCGATCTTG GCTCACTGCA 852 0 

ACTTCCGCCT CCCAGGTTCA AGCGATTCTC CTGCCTCAGC CTCGAGAGTA GCTGGGATTA 85 8 0 

TAGGCATGCG CCACCACACC CAGCTAATTT TGTATTTTTA GTAAAGATGG GGTTTCTTCA 8 64 0 

TGTTGGTCAA GCTGGTCTGG AACTCCCCGC CTCGGGTGAT CTGCCCGCCT CGGCCTCCGA 87 0 0 

AAGTGCTGGG ATTACAGGTG TGATCCACCA CACCCAGCCG ATTTATATGT ATATAAATCA 8760 

CATTCCTCTA ACCAAAATGT AGTGTTTCCT TCCATCTTGA ATATAGGCTG TAGACCCCGT 8 82 0 

GGGTATGGGA CATTGTTAAC AGTGAGACCA CAGCA.GTTTT TATGTCATCT GACAGCATCT 88 8 0 

CCAAATAGCC TTCATGGTTG TCACTGCTTC CCAAGACAAT TCCAAATAAC ACTTCCCAGT 8 94 0 

GATGACTTGC TACTTGCTAT TGTTACTTAA TGTGTTAAGG TGGCTGTTAC AGACACTATT 90 0 0 

AGTATGTCAG GAATTACACC AAAATTTAGT GGCTCAAACA ATCATTTTAT TATGTATGTG 90 6 0 

GATTCTCATG GTCAGGTCAG GATTTCAGAC AGGGCACAAG GGTAGCCCAC TTGTCTCTGT 912 0 

CTATGATGTC TGGCCTCAGC ACAGGAGACT CAACAGCTGG GGTCTGGGAC CATTTGGAGG 918 0 

CTTGTTCCCT CACATCTGAT ACCTGGCTTG GGATGTTGGA AGAGGGGGTG AGCTGAGACT 92 4 0 

GAGTGCCTAT ATGTAGTGTT TCCATATGGC CTTGACTTCC TTACAGCCTG GCAGCCTCAG 93 0 0 
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J™"^^^^^ ATTCTTAGGA GGCACAGGGC TCCAGGGCAG ATGCTGAGGG GTCTTTTATG 9360 
AGGTAGCACA GCAAATCCAC CCAGGATC ^it^irrTATG 93 60 



(2) INFORMATION FOR SEQ ID NO: 14 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:142 



(2) INFORMATION FOR SEQ ID NO:143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 3: 



IZ^^ GCAGTGTGAT GTCCACTGCA GTGTGTTGCT GGGAACAGTT AATGAGCAAA 

TTGTATACAA TGGCTAGTAC ATTGACCGGG ATTTGTTGAA GCTGGTGAGT GtStgS^JJ 

AGCCTGTTAG ACTAGTCTAT . GCACATGGCT CTGGTCAACT ACCGCTCTCT St^CcI 

GATAAATCCC CCATGCTTTA TATTCTCTTC CAAACATACT ATCCTcScI cSSSgS 

CCTTTGTTAA TGCTTTGTTC TAGACTTTCC CTTTTCTGTT TTCTTATTCA SJSSSc 

TCTTTGCATA GATTGTAAAT TCAAATGCCC TCAGGGTGCA GGCAGT^cS ^SggSaGG 

GAGGCTAGCC AGTGAGATCT GCATCACACT GCTCGACTTA CA GTAAGGGAGG 

(2) INFORMATION FOR SEQ ID NO:144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



9388 



GTCTTTGGAG 


AGAGCATATC 


CATCTCCTCC 


60 


GAGCAGAATT 


AAACAGGGTA 


GTCTTAACCA 


120 


GGTTAAAGAA 


CACTTTTGGT 


TTACACTTGT 


180 


GCAATAAGTT 


TGTGTGTAAT 


CAGAAGGAGT 


240 


TAACTTGGGA 


GCTGTGTAAT 


TTAGGCTTTG 


300 


GAAGTGATTG 


TGTGTTCGCG 


TGTGTGTGCG 


360 


CAAATACCCA 


ACATCACACT 


GCTCGACTT 


419 



60 
120 
180 
240 
300 
360 
402 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 4 

I^^n?^'''^'' CCTCCTCAGG CCAAGAAGAT AAAGCTTCAG ACCCCTAACA CATTTCCAAA 6 0 

^ll^a^ ^rir""^ GGGCATCATC CCCGTTCCGA AGGGTCAGGG AGgIgSSJJ 120 

TGAGGTGGAT TCACGAGTTG CGGACAACTC CTTTGATGCC AAGCGAGGTG CAGCCGGAGA 180 
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CTGGGGAGAG CGAGCCAATC AGGTTTTGAA GTTCCTCTCA GTGC 
(2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 

AGCCATTTAC CACCCATCCA CAAAAAAAAA AAAAAAAAAG AAAAATATCA AGGAATAAAA 6 0 

ATAGACTTTG AACAAAAAGG AACATTTGCT GGCCTGAGGA GGCATCACCC G 111 

(2) INFORMATION FOR SEQ ID NO: 14 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid - 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 6: 



TAGCATGTTG 


AGCCCAGACA 


CTTGTAGAGA 


GAGGAGGACA 


GTTAGAAGAA 


GAAGAAAAGT 


60 


TTTTAAATGC 


TGAAAGTTAC 


TATAAGAAAG 


CTTTGGCTTT 


GGATGAGACT 


TTTAAAGATG 


120 


CAGAGGATGC 


TTTGCAGAAA 


CTTCATAAAT 


ATATGCAGGT 


GATTCCTTAT 


TTCCTCCTAG 


180 


AAATTTAGTG 


ATATTTGAAA 


TAATGCCCAA 


ACTTAATTTT 


CTCCTGAGGA 


AAACTATTCT 


240 


ACATTACTTA 


AGTAAGGCAT 


TATGAAAAGT 


TTCTTTTTAG 


GTATAGTTTT 


TCCTAATTGG 


300 


GTTTGACATT 


GCTTCATAGT 


GCCTCTGTTT 


TTGTCCATAA 


TCGAAAGTAA 


AGATAGCTGT 


3 60 


GAGAAAACTA 


TTACCTAAAT 


TTGGTATGTT 


GTTTTGAGAA 


ATGTCCTTAT 


AGGGAGCTCA 


420 


CCTGGTGGTT 


TTTAAATTAT 


TGTTGCTACT 


ATAATTGAGC 


TAATTATAAA 


AACCTTTTTG 


480 


AGACATATTT 


TAAATTGTCT 


TTTCCTGTAA 


TACTGATGAT 


GATGTTTTCT 


CATGCATTTT 


540 


CTTCTGAATT 


GGGACCATTG 


CTGCTGTGTC 


TGGGCTCACA 


TGCTA 




585 



(2) INFORMATION FOR SEQ ID N0:147: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 

TAGCATGTTG AGCCCAGACA CTGGGCAGCG GGGGTGGCCA CGGCAGCTCC TGCCGAGCCC 6 0 

AAGCGTGTTT GTCTGTGAAG GACCCTGACG TCACCTGCCA GGCTAGGGAG GGGTCAATGT 12 0 

GGAGTGAATG TTCACCGACT TTCGCAGGAG TGTGCAGAAG CCAGGTGCAA CTTGGTTTGC 18 0 

TTGTGTTCAT CACCCCTCAA GATATGCACA CTGCTTTCCA AATAAAGCAT CAACTGTCAT 24 0 

CTCCAGATGG GGAAGACTTT TTCTCCAACC AGCAGGCAGG TCCCCATCCA CTCAGACACC 3 00 

AGCACGTCCA CCTTCTCGGG CAGCACCACG TCCTCCACCT TCTGCTGGTA CACGGTGATG 3 60 
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J^Sn^^'''''^ AGCCGTTCTG CAKGACCAGC TGCCCCGTGT GCTGTGCCAT CTCACTGGCC 

^^^Jn^^^r ^^^^^^^'^^T AGGCCGCGCA TANTGTGCAC AGAAnS^Jg SgA^CCaSS 

CCCACAGCCC ACGTCCAAGA NGACTTTATC CGTCAGGGAT TCTTTATtS Sc2-™? 

CTGTGGTATT AATTGTTCGT GTCTGGGCTC AACATGCTA ^"'^^'^''^^^^ GCAGoATGAC 

(2) INFORMATION FOR SEQ ID NO: 14 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



420 
480 
540 
579 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

TGACACCTTG TCCAGCATCT GCAAGCCAGG AAGAGAGTCC TCACCAAGAT CCCC;.CCCCG 
TTGGCACCAG GATCTTGGAC TTCCAATCTC CAGAACTGTG AGAAATA^S^ S^rrS . !n 

= S = — ™i = B 

249 

(2) INFOR^aATIOM FOR SEQ ID NO : 14 9 : . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

1°^^^^'^'^^° TCCAGCATCT GCTATTTTGT GACTTTTTAA TAATAGCCAT TCTGACTGGT 
S^^''^^^^ AACTCATTGT GGGTTTGGTC TGCATTTCTC TAATGAtSS IgItAtI^I 
CTTTTTTTAA ATATGCTTGT TGACCACATG TATATCATCT TTTGAgSS? ItcIgII^T 

cllZlZ^^G T^^^f™ c\^SSg^Sg 



60 
120 

CTGGACAAGG TGTCA ^^-^^-xxr ^XTTAATTTC CTTACAGATG 240 

255 

(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: ' SEQ ID NO:150: 

TTACGCTGCA ACACTGTGGA GGCCAAGCTG GGATCACTTC TTCATTPTAZi r^nn^^r^nr^o. 
GGGAAGTTCA AGTCCAGCAG AGGGTGGGTG GGTAGAcIS GGCaSSS S^SSSS llo 
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CACAGTGTTG CAGCGTAA 

(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

TNACGCNGCN ACNNTGTAGA GANGGNAAGG CNTTCCCCAC ATTNCCCCTT CATNANAGAA 60 

TTATTCNACC AAGNNTGACC NATGCCNTTT ATGACTTACA TGCNNACTNC NTAATCTGTN 12 0 

TCNNGCCTTA AAAGCNNNTC CACTACATGC NTCANCACTG TNTGTGTNAC NTCATNAACT 18 0 

GTCNGNAATA GGGGCNCATA ACTACAGAAA TGCANTTCAT ACTGCTTCCA NTGCCATCNG 24 0 

CGTGTGGCCT TNCCTACTCT TCTTNTATTC CAAGTAGCAT CTCTGGANTG CTTCCCCACT 3 00 

CTCCACATTG TTGCAGCNAT AAT 32 3 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 

TCAAGATTCC ATAGGCTGAC CAGTCCAAGG AGAGTTGAAA TCATGAAGGA GAGTCTATCT 6 0 

GGAGAGAGCT GTAGTTTTGA GGGTTGCAAA GACTTAGGAT GGAGTTGGTG GGTGTGGTTA 12 0 

GTCTCTAAGG TTGATTTTGT TCATAAATTT CATGCCCTGA ATGCCTTGCT TGCCTCACCC 18 0 

TGGTCCAAGC CTTAGTGAAC ACCTAAAAGT CTCTGTCTTC TTGCTCTCCA AACTTCTCCT 24 0 

GAGGATTTCC TCAGATTGTC TACATTCAGA TCGAAGCCAG TTGGCAAACA AGATGCAGTC 300 

CAGAGGGTCA G 311 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153: 

CAAGATTCCA TAGGCTGACC AGGAGGCTAT TCAAGATCTC TGGCAGTTGA GGAAGTCTCT 

TTAAGAAAAT AGTTTAAACA ATTTGTTAAA ATTTTTCTGT CTTACTTCAT TTCTGTAGCA 

GTTGATATCT GGCTGTCCTT TTTATAATGC AGAGTGGGAA CTTTCCCTAC CATGTTTGAT 

AAATGTTGTC CAGGCTCCAT TGCCAATAAT GTGTTGTCCA AAATGCCTGT TTAGTTTTTA 

AAGACGGAAC TCCACCCTTT GCTTGGTCTT AAGTATGTAT GGAATGTTAT GATAGGACAT 
AGTAGTAGCG GTGGTCAGCC TATGGAATCT TG 



60 
120 
180 
240 
300 
332 
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(2) INFORMATION FOR SEQ ID NO : 154 : 

{ i ) SEQUENCE CHARACTERISTICS : 

(-A^)- LENGTH : 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:154: 

TCAAGATTCC ATAGGCTGAC CTGGACAGAG ATCTCCTGGG TCTGGCCCAG GACAGCAGGC 6 0 

TCAAGCTCAG TGGAGAAGGT TTCCATGACC CTCAGATTCC CCCAAACCTT GGATTGGGTG 12 0 

ACATTGCATC TCCTCAGAGA GGGAGGAGAT GTANGTCTGG GCTTCCACAG GGACCTGGTA 180 

TTTTAGGATC AGGGTACCGC TGGCCTGAGG CTTGGATCAT TCANAGCCTG GGGGTGGAAT 240 

GGCTGGCAGC CTGTGGCCCC ATTGAAATAG GCTCTGGGGC ACTCCCTCTG TTCCTANTTG 3 00 

AACTTGGGTA AGGAACAGGA ATGTGGTCAN CCTATGGAAT CTTGA 345 

(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:155: 

GACGCTTGGC CACTTGACAC ATTAAACAGT TTTGCATAAT CACTANCATG TATTTCTAGT 6 0 

TTGCTGTCTG CTGTGATGCC CTGCCCTGAT TCTCTGGCGT TAATGATGGC AAGCATAATC 12 0 

AAACGCTGTT CTGTTAATTC CAAGTTATAA CTGGCATTGA TTAAAGCATT ATCTTTCACA 18 0 

ACTAAACTGT TCTTCATANA ACAGCCCATA TTATTATCAA ATTAAGAGAC AATGTATTCC 24 0 

AATATCCTTT ANGGCCAATA TATTTNATGT CCCTTAATTA AGAGCTACTG TCCGT 2 95 

(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 6: 

GACGCTTGGC CACTTGACAC TGCAGTGGGA AAACCAGCAT GAGCCGCTGC CCCCAAGGAA 60 

CCTCGAAGCC CAGGCAGAGG ACCAGCCATC CCAGCCTGCA GGTAAAGTGT GTCACCTGTC 12 0 

AGGTGGGCTT GGGGTGAGTG GGTGGGGGAA GTGTGTGTGC AAAGGGGGTG TNAATGTNTA 18 0 

TGCGTGTGAG CATGAGTGAT GGCTAGTGTG ACTGCATGTC AGGGAGTGTG AACAAGCGTG 24 0 

CGGGGGTGTG TGTGCAAGTG CGTATGCATA TGAGAATATG TGTCTGTGGA TGAGTGCATT 3 00 

TGAAAGTCTG TGTGTGTGCG TGTGGTCATG ANGGTAANTT ANTGACTGCG CAGGATGTGT 3 60 

GAGTGTGCAT GGAACACTCA NTGTGTGTGT CAAGTGGCCN ANCGTC 4 06 
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(2) INFORMATION FOR SEQ ID NO: 15 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 7: 



TGACGCTTGG CCACTTGACA CACTAAAGGG TGTTACTCAT CACTTTCTTC TCTCCTCGGT 6 0 

GGCATGTGAG TGCATCTATT CACTTGGCAC TCATTTGTTT GGCAGTGACT GTAANCCANA 12 0 

TCTGATGCAT ACACCAGCTT GTAAATTGAA TAAATGTCTC TAATACTATG TGCTCACAAT 180 

ANGGTANGGG TGAGGAGAAG GGGAGAGA 2 08 



(2) INFORMATION FOR SEQ ID NO: 15 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 



CTTCAACCTC CTTCAACCTC CTTCAACCTC CTGGATTCAA ACAATCATCC CACCTCAGAC 6 0 

TCCTTAGTAG CTGAGACTAC AGACTCACGC CACTACATCT GGCTAAATTT TTGTAGAGAT 12 0 

AGGGTTTCAT CATGTTGCCC TGGCTGGTCT CAAACTCCTG ACCTCAAGCA ATGTGCCCAC 180 

CTCAGCCTCC CAAAGTGCTG GGATTACAGG CATAAGCCAC CATGCCCAGT CCATNTTTAA 24 0 

TCTTTCCTAC CACATTCTTA CCACACTTTC TTTTATGTTT AGATACATAA ATGCTTACCA 3 00 

TTATGATACA ATTGCCCACA GTATTAAGAC AGTAACATGC TGCACAGGTT TGTAGCCTAG 3 60 

GAACAGTAGG CAATACCACA TAGCTTAGGT GTGTGGTAGA CTATACCATC TAGGTTTGTG 42 0 

TAAGTTACAC TTTATGCTGT TTACACAATG ACAAAACCAT CTAATGATGC ATTTCTCAGA 4 80 

ATGTATCCTT GTCAGTAAGC TATGATGTAC AGGGAACACT GCCCAAGGAC ACAGATATTG 54 0 

TACCTGT 54 7 



(2) INFORMATION FOR SEQ ID NO: 15 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 



GCTCCTCTTG CCTTACCAAC TCACCCAGTA TGTCAGCAAT TTTATCRGCT TTACCTACGA 6 0 

AACAGCCTGT ATCCAAACAC TTAACACACT CACCTGAAAA GTTCAGGCAA CAATCGCCTT 12 0 

CTCATGGGTC TCTCTGCTCC AGTTCTGAAC CTTTCTCTTT TCCTAGAACA TGCATTTARG 18 0 

TCGATAGAAG TTCCTCTCAG TGC 203 



(2) INFORMATION FOR SEQ ID NO: 160: 



BNSDOCID: <WO 9845328A2_L> 



wo 98/45328 



PCt/US98/06939 



88 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

TGTAAGTCGA GCAGTGTGAT GGGTGGAACA GGGTTGTAAG CAGTAATTGC AAACTGTATT 
TAAACAATAA TAATAATATT TAGCATTTAT AGAGCACTTT ATATCTTCAA AGTACTTGCA 
AACATTAYCT AATTAAATAC CCTCTCTGAT TATAATCTGG ATACAAATGC ACTTAAACTC 
AGGACAGGGT CATGAGARAA GTATGCATTT GAAAGTTGGT GCTAGCTATG CTTTAAAAAC 
CTATACAATG ATGGGRAAGT TAGAGTTCAG ATTCTGTTGG ACTGTTTT^G TGCATTTCAG 
TTCAGCCTGA TGGCAGAATT AGATCATATC TGCACTCGAT GACTYTGCTT GATAACTTAT 
CACTGAAATC TGAGTGTTGA TCATCACACT GCTCGACTTA CA 

(2) INFORMATION FOR SEQ ID NO: 161: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 i near 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

AGCATGTTGA GCCCAGACAC TGACCAGGAG AAAAACCAAC CAATAGAAAC ACGCCCAGAC 
ACTGACCAGG AGAAAAACCA ACCAATAAAA .ACAGGCCCGG ACATAAGACA AATAATAAAA 
TTAGCGGACA AGGACATGAA AACAGCTATT GTAAGAGCGG ATATAGTGGT GTGTGTCTGG 
GCTCAACATG CTA 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

TGTTGAGCCC AGACACTGAC CAGGAGAAAA ACCAACCAAT AAAAACAGGC CCGGACATAA 
GACAAATAAT AAAATTAGCG GACAAGGACA TGAAAACAGC TATTGTAAGA GCGGATATAG 
TGGTGTGTGT CTGGGCTCAA CATGCTA 

(2) INFORMATION FOR SEQ ID NO:163: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 



TAGCATGTTG AGCCCAGACA CAAATCTTTC CTTAAGCAAT AAATCATTTC TGCATATGTT 60 

TTTAAAACCA CAGCTAAGCC ATGATTATTC AAAAGGACTA TTGTATTGGG TATTTTGATT 12 0 

TGGGTTCTTA TCTCCCTCAC ATTATCTTCA TTTCTATCAT TGACCTCTTA TCCCAGAGAC 180 

TCTCAAACTT TTATGTTATA CAAATCACAT TCTGTCTCAA AAAATATCTC ACCCACTTCT 24 0 

CTTCTGTTTC TGCGTGTGTA TGTGTGTGTG TGTGTGTCTG GGCTCAACAT GCTA 2 94 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 164 : 

CGGGATTGGC TTTGAGCTGC AGATGCTGCC TGTGACCGCA CCCGGCGTGG AACAGAAAGC 6 0 

CACCTGGCTG CAAGTGCGCC AGAGCCGCCC TGACTACGTG CTGCTGTGGG GCTGGGGCGT 12 0 

GATGAACTCC ACCGCCCTGA AGGAAGCCCA GGCCACCGGA TACCCCCGCG ACAAGATGTA 18 0 

CGGCGTGTGG TGGGCCGGTG CGGAGCCCGA TGTGCGTGAC GTGGGCGAAG GCGCCAAGGG 24 0 

CTACAACGCG CTGGCTCTGA ACGGCTACGG CACGCAGTCC AAGGTGATCC ANGACATCCT 3 00 

GAAACACGTG CACGACAAGG GCCAGGGCAC GGGGCCCAAA GACGAAGTGG GCTCGGTGCT 3 60 

GTACACCCGC GGCGTGATCA TCCAGATGCT GGACAAGGTG TCAATCACTA AT 412 



(2) INFORMATION FOR SEQ ID NO: 165: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



TTGACACCTT GTCCAGCATC TGCATCTGAT GAGAGCCTCA GATGGCTACC ACTAATGGCA 60 

GAAGGCAAAG GAGAACAGGC ATTGTATGGC AAGAAAGGAA GAAAGAGAGA GGGGAGAAAG 12 0 

GTGCTAGGTT CTTTTCAACA ACCAGTTCTT GATGGAACTG AGAGTAAGAG CTCAAGGCCA 180 

GGTGTGGTGA CTCCAACCAG TAATCCCAAC ATTTTAGGAG GCTGAGGCAG GCAGATGTCT 24 0 

TGACCCCATG AGTTTGTGAC CAGCCTGAAC AACATCATGA GACTCCATCT CTACAATAAT 3 00 

TACAAAAATT AATCAGGCAT TGTGGTATGC CCTGTAGTCC CAGATGCTGG ACAAGGTGTC 3 60 

A 361 

(2) INFORMATION FOR SEQ ID NO: 166: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:166: 

TWGACTGACT CATGTCCCCT ACACCCAACT ATCTTCTCCA GGTGGCCAGG CATGATAGAA 60 

TCTGATCCTG ACTTAGGGGA ATATTTTCTT TTTACTTCCC ATCTTGATTC CCTGCCGGTG 12 0 

AGTTTCCTGG TTCAGGGTAA GAAAGGAGCT CAGGCCAAAG TAATGAACAA ATCCATCCTC 180 

ACAGACGTAC AGAATAAGAG AACWTGGACW TAGCCAGCAG AACMCAAKTG AAAMCAGAAC 24 0 

MCTTAMCTAG GATRACAAMC MCRRARATAR KTGCYCMCMC WTATAATAGA AACCAAACTT 3 00 

GTATCTAATT AAATATTTAT CCACYGTCAG GGCATTAGTG GTTTTGATAA ATACGCTTTG 3 60 

GCTAGGATTC CTGAGGTTAG AATGGAARAA CAATTGCAMC GAGGGTAGGG GACATGAGTC 42 0 
AKTCTAA 

427 

(2) INFORMATION FOR SEQ ID NO:167: 

(i) SEQUENjCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 167 : 

AACGTCGCAT GCTCCCGGCC GCCATGGCCG CGGGATAGAC TGACTCATGT CCCCTAAGAT 6 0 

AGAGGAGACA CCTGCTAGGT GTAAGGAGAA GATGGTTAGG TCTACGGAGG CTCCAGGGTG 12 0 

GGAGTAGTTC CCTGCTAAGG GAGGGTAGAC TGTTCAACCT GTTCCTGCTC CGGCCTCCAC 18 0 
TATAGCAGAT GCGAGCAGGA GTAGGAGAGA GGGAGGTAAG AGTCAGAAGC TTATGTTGTT . 24 0 

TATGCGGGGA AACGCCRTAT CGGGGGCAGC CRAGTTATTA GGGGACANTR TAGWYARTCW 3 00 

AGNTAGCATC CAAAGCGNGG GAGTTNTCCC ATATGGTTGG ACCTGCAGGC GGCCGCATTA 3 60 

GTGATTAGCA TGTGAGCCCC AGACACGCAT AGCAACAAGG ACCTAAACTC AGATCCTGTG 42 0 

CTGATTACTT AACATGAATT ATTGTATTTA TTTAACAACT TTGAGTTATG AGGCATATTA 480 

TTAGGTCCAT ATTACCTGGA c^qq 

(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 i ne ar 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

TTCATCGCTC GGTGACTCAA GCCTGTAATC CCAGAACTTT GGGAGGCCGA GGGGAGCAGA 6 0 

TCACCTGAGG TTGGGAGTTT GAGACCAGCC TGGCCAACAT GGTGACAACC CGTCTCTGCT 12 0 

AAAAATACAA AAATTAGCCA AGCATGGTGG CATGCACTTG TAATCCCAGC TACTCGGGAG 18 0 

GCTGAGGCAG GAGAATCACT TGAGGCCAGG AGGCAGAGGT TGCAGTGAGG CAGAGGTTGA 24 0 

GATCATGCCA CTGCACTCCA GCCTGGGCAA CAGAGTAAGA CTCCATCTCA AAAAAAAAAA 3 00 

AAAAAAAGAA TGATCAGAGC CACAAATACA GAAAACCTTG AGTCACCGAG CGATGAAA 35 8 
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(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 9: 



TTCTGTCCAC ACCAATCTTA GAGCTCTGAA AGAATTTGTC TTTAAATATC TTTTAATAGT 60 

AACATGTATT TTATGGACCA AATTGACATT TTCGACTATT TTTTCCCAAA AAAAGTCAGG 12 0 

TGAATTTCAG CACACTGAGT TGGGAATTTC TTATCCCAGA AGWCGGCACG AGCAATTTCA 180 

TATTTATTTA AGATTGATTC CATACTCCGT TTTCAAGGAG AATCCCTGCA GTCTCCTTAA 24 0 

AGGTAGAACA AATACTTTCT ATTTTTTTTT CACCATTGTG GGATTGGACT TTAAGAGGTG 3 00 

ACTCTAAAAA AACAGAGAAC AAATATGTCT CAGTTGTATT AAGCACGGAC CCATATTATC 3 60 

ATATTCACTT AAAAAAATGA TTTCCTGTGC ACCTTTTGGC AACTTCTCTT TTCAATGTAG 42 0 

GGAAAAACTT AGTCACCCTG AAAACCCACA AAATAAATAA AACTTGTAGA TGTGGGCAGA 4 80 

ARGTTTGGGG GTGGACATTG TATGTGTTTA AATTAAACCC TGTATCACTG AGAAGCTGTT 54 0 

GTATGGGTCA GAGAAAATGA ATGCTTAGAA GCTGTTCACA TCTTCAAGAG CAGAAGCAAA 600 

CCACATGTCT CAGCTATATT ATTATTTATT TTTTATGCAT AAAGTGAATC ATTTCTTCTG 6 60 

TATTAATTTC CAAAGGGTTT TACCCTCTAT TTAAATGCTT TGAAAAACAG TGCATTGACA 72 0 

ATGGGTTGAT ATTTTTCTTT AAAAGAAAAA TATAATTATG AAAGCCAAGA TAATCTGAAG 7 80 

CCTGTTTTAT TTTAAAA.CTT TTTATGTTCT GTGGTTGATG TTGTTTGTTT GTTTGTTTCT 84 0 

ATTTTGTTGG TTTTTTACTT TGTTTTTTGT TTTGTTTTGT TTTGGTTTDG CATACTACAT 900 

GCAGTTTCTT TAACCAATGT CTGTTTGGCT AATGTAATTA AAGTTGTTAA TTTATATGAG 96 0 

TGCATTTCAA CTATGTCAJ\T GGTTTCTTAA TATTTATTGT GTAGAAGTAC TGGTAATTTT 1020 

TTTATTTACA ATATGTTTAA AGAGATAACA GTTTGATATG TTTTCATGTG TTTATAGCAG 10 80 

AAGTTATTTA TTTCTATGGC ATTCCAGCGG ATATTTTGGT GTTTGCGAGG CATGCAGTCA 114 0 

ATATTTTGTA CAGTTAGTGG ACAGTATTCA GCAACGCCTG ATAGCTTCTT TGGCCTTATG 12 0 0 

TTAAATAAAA AGACCTGTTT GGGATGTAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 12 60 

AAAAA * 1265 



(2) INFORMATION FOR SEQ ID NO: 17 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 



TGTAAGTCGA GCAGTGTGAT GACGATATTC TTCTTATTAA TGTGGTAATT GAACAAATGA 60 

TCTGTGATAC TGATCCTGAG CTAGGAGGCG CTGTTCAGTT AATGGGACTT CTTCGTACTC 12 0 

TAATTGATCC AGAGAACATG CTGGCTACAA CTAATAAAAC CGAAAAAAGT GAATTTCTAA ISO 

ATTTTTTCTA CAACCATTGT ATGCATGTTC TCACAGCACC ACTTTTGACC AATACTTCAG 24 0 

AAGACAAATG TGAAAAGGAT AATATAGTTG GATCAAACAA T^AACAACACA ATTTGTCCCG 3 00 

ATAATTATCA AACAGCACAG CTACTTGCCT TAATTTTAGA GTTACTCACA TTTTGTGTGG 3 60 

AACATCACAC TGCTCGACTT ACA 3 83 



(2) INFORMATION FOR SEQ ID NO: 171: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TO POLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

TGGGCACCTT CAATATCGCA AGTTAAAAAT AATGTTGAGT TTATTATACT TTTGACCTGT 60 

TTAGCTCAAC AGGGTGAAGG CATGTAAAGA ATGTGGACTT CTGAGGAATT TTCTTTTAAA 12 0 

AAGAACATAA TGAAGTAACA TTTTAATTAC TCAAGGACTA CTTTTGGTTG AAGTTTATAA .18 0 

TCTAGATACC TCTACTTTTT GTTTTTGCTG TTCGACAGTT CACAAAGACC TTCAGCAATT 24 0 

TACAGGGTAA AATCGTTGAA GTAGTGGAGG TGAAACTGAA ATTTAAAATT ATTCTGTAAA 3 00 

TACTATAGGG AAAGAGGCTG AGCTTAGAAT CTTTTGGTTG TTCATGTGTT CTGTGCTCTT 3 60 

ATCATCACAC TGCTCGACTT ACA 3 83 

(2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 172 : 

TCGGGTGATG CCTCCTCAGG CTTGTCGTTA GTGTACACAG AGCTGCTCAT GAAGCGACAG 60 

CGGCTGCCCC TGGCACTTCA GAACCTCTTC CTCTACACTT TTGGTGCGCT TCTGAATCTA 12 0 

GGTCTGCATG CTGGCGGCGG CTCTGGCCCA GGCCTCCTGG AAAGTTTCTC AGGATGGGCA 18 0 

GCACTCGTGG TGCTGAGCCA GGCACTAAAT GGACTGCTCA TGTCTGCTGT CATGGAGCAT 24 0 

GGCAGCAGCA TCACACGCCT CTTTGTGGTG TCCTGCTCGC TGGTGGTCAA CGCCGTGCTC 3 00 

TCAGCAGTCC TGCTACGGCT GCAGCTCACA GCCGCCTTCT TCCTGGCCAC ATTGCTCATT 3 60 

GGCCTGGCCA TGCGCCTGTA CTATGGCAGC CGCTAGTCCC TGACAACTTC CACCCTGATT 42 0 

CCGGACCCTG TAGATTGGGC GCCACCACCA GATCCCCCTC CCAGGCCTTC CTCCCTCTCC 4 80 

CATCAGCGGC CCTGTAACAA GTGCCTTGTG AGAAAAGCTG GAGAAGTGAG GGCAGCCAGG 54 0 

TTATTCTCTG GAGGTTGGTG GATGAAGGGG TACCCCTAGG AGATGTGAAG TGTGGGTTTG 600 

GTTAAGGAAA TGCTTACCAT CCCCCACCCC CAACCAAGTT NTTCCAGACT AAAGAATTAA 66 0 

GGTAACATCA ATACCTAGGC CTGAGGAGGC ATCACCCGA 6 99 

(2) INFORMATION FOR SEQ ID NO:173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

TCGGGTGATG CCTCCTCAGG CCAGATCAAA CTTGGGGTTG AAAACTGTGC AAAGAAATCA 60 
ATGTCGGAGA AAGAATTTTG CAAAAGAAAA ATGCCTAATC AGTACTAATT TAATAGGTCA 12 0 
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CATTAGCAGT GGAAGAAGAA ATGTTGATAT TTTATGTCAG CTATTTTATA ATCACCAGAG 18 0 

TGCTTAGCTT CATGTAAGCC ATCTCGTATT CATTAGAAAT AAGAACAATT TTATTCGTCG 24 0 

GAAAGAACTT TTCAATTTAT AGCATCTTAA TTGCTCAGGA TTTTAAATTT TGATAAAGAA 300 

AGCTCCACTT TTGGCAGGAG TAGGGGGCAG GGAGAGAGGA GGCTCCATCC ACAAGGACAG 360 

AGACACCAGG GCCAGTAGGG TAGCTGGTGG CTGGATCAGT CACAACGGAC TGACTTATGC 42 0 

CATGAGAAGA AACAACCTCC AAATCTCAGT TGCTTAATAC AACACAAGCT CATTTCTTGC 480 

TCACGTTACA TGTCCTATGT AGATCAACAG CAGGTGACTC AGGGACCCAG GCTCCATCTC 54 0 

CATATGAGCT TCCATAGTCA CCAGGACACG GGCTCTGAAA GTGTCCTCCA TGCAGGGACA 6 00 

CATGCCTCTT CCTTTCATTG GGCAGAGCAA GTCACTTATG GCCAGAAGTC ACACTGCAGG 66 0 

GCAGTGCCAT CCTGCTGTAT GCCTGAGGAG GCATCACCCG A 701 

(2) INFORMATION FOR SEQ ID NO: 174: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 



TCGGGTGATG 


CCTCCTCANG 


CCCCTAAATC 


AGAGTCCAGG 


GTCAGAGCCA 


CAGGAGACAG 


60 


GGAAAGACAT 


AGATTTTAAC 


CGGCCCCCTT 


CAGGAGATTC 


TGAGGCTCAG 


TTCACTTTGT 


120 


TGCAGTTTGA 


ACAGAGGCAG 


CAAGGCTAGT 


GGTTAGGGGC 


ACGGTCTCTA 


AAGCTGCACT 


180 


GCCTGGATCT 


GCCTCCCAGC 


TCTGCCAGGA 


ACCAGCTGCG 


TGGCCTTGAG 


CTGCTGACAC 


240 


GCAGAAAGCC 


CCCTGTGGAC 


CCAGTCTCCT 


CGTCTGTAAG 


ATGAGGACAG 


GACTCTAGGA 


300 


ACCCTTTCCC 


TTGGTTTGGC 


CTCACTTTCA 


CAGGCTCCCA 


TCTTGAACTC 


TATCTACTCT 


360 


TTTCCTGAAA 


CCTTGTAAAA 


GAAAAAAGTG 


CTAGCCTGGG 


CAACATGGCA 


AAACCCTGTC 


420 


TCTACAAAAA 


ATACAAAAAT 


TAGTTGGGTG 


TGGTGGCATG 


TGCCTGTAGT 


CCCAGCCACT 


480 


TGGGAGGTGC 


TGAGGTGGGA 


GGATCACTTG 


AGCCCGGGAG 


GTGGAGGTTG 


CAGTGAGCCA 


540 


AGATCATGCC 


ACTGCACTCC 


AGCCTGAGTA 


ATAGAGTAAG 


ACTCTGTCTC 


AAAAACAACA 


600 


ACAACAACAG 


TGAGTGTGCC 


TCTGTTTCCG 


GGTTGGATGG 


GGCACCACAT 


TTATGCATCT 


660 


CTCAGATTTG 


GACGCTGCAG 


CCTGAGGAGG 


CATCACCCGA 






700 



(2) INFORMATION FOR SEQ ID NO:175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

TATAGGGCGA ATTGGGCCCG AGTTGCATGN • TCCCGGCCGC CATGGCCGCG GGATTCGGGT 60 

GATGCCTCCT CAGGCTTGTC TGCCACAAGC TACTTCTCTG AGCTCAGAAA GTGCCCCTTG 12 0 

ATGAGGGAAA ATGTCCTACT GCACTGCGAA TTTCTCAGTT CCATTTTACC TCCCAGTCCT 180 

CCTTCTAAAC CAGTTAATAA ATTCATTCCA CAAGTATTTA CTGATTACCT GCTTGTGCCA 24 0 

GGGACTATTC TCAGGCTGAA GAAGGTGGGA GGGGAGGGCG GAACCTGAGG AGCCACCTGA 3 00 

GCCAGCTTTA TATTTCAACC ATGGCTGGCC CATCTGAGAG CATCTCCCCA CTCTCGCCAA 3 60 

CCTATCGGGG CATAGCCCAG GGATGCCCCC AGGCGGCCCA GGTTAGATGC GTCCCTTTGG 42 0 

CTTGTCAGTG ATGACATACA CCTTAGCTGC TTAGCTGGTG CTGGCCTGAG GAGGCATCAC 4 80 

CCGA 4 84 
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(2) INFORMATION FOR SEQ ID NO: 176; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 32 base pai 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 6: 
TCGGGTGATG CCTCCTCAGG GCTCAAGGGA TGAGAAGTGA CTTCTTTCTG CArrrnnnn^ 

^ ~ = iiii EEi E= -° 

5rj??s^c^ ™- 

432 

(2) INFORMATION FOR SEQ ID NO: 17 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO- 17-7. 

^ ™s: sss ss= = 
=ss :ssss ;es= sss s=s 



60 
120 
180 
240 
300 
360 
420 

ACATGCTA '^^""^^^^^^ GTGATATGAG ACAGACAGTT GCGGTGGGTG TCTGGGCTCA 



(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 base pairs 
. CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



780 
788 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 178: 

TAGCATGTTG AGCCCAGACA CCTGTGTTTC TGGGAGCTCT GGCAGTGGCG GATTCATAGG 6 0 

CACTTGGGCT GCACTTTGAA TGACACACTT GGCTTTATTA GATTCACTAG TTTTTAAAAA 12 0 

ATTGTTGTTC GTTTCTTTTC ATTAAAGGTT TAATCAGACA GATCAGACAG CATAATTTTG 18 0 

TATTTAATGA CAGAAACGTT GGTACATTTC TTCATGAATG AGCTTGCATT CTGAAGCAAG 24 0 

AGCCTACAAA AGGCACTTGT TATAAATGAA AGTTCTGGCT CTAGAGGCCA GTACTCTGGA 3 00 

GTTTCAGAGC AGCCAGTGAT TGTTCCAGTC AGTGATGCCT AGTTATATAG AGGAGGAGTA 3 60 

CACTGTGCAC TCTTCTAGGT GTAAGGGTAT GCAACTTTGG ATCTTAAAAT TCTGTACACA 42 0 

TACACACTTT ATATATATGT ATGTATGTAT GAAAACATGA AATTAGTTTG TCAAATATGT 4 80 

GTGTGTTTAG TATTTTAGCT TAGTGCAACT ATTTCCACAT TATTTATTAA ATTGATCTAA 54 0 

GACACTTTCT TGTTGACACC TTGAATATTA ATGTTCAAGG GTGCAATGTG TATTCCTTTA 600 

GATTGTTAAA GCTTAATTAC TATGATTTGT AGTAAATTAA CTTTTAAAAT GTATTTGAGC 66 0 

CCTTCTGTAG TGTCGTAGGG CTCTTACAGG GTGGGAAAGA TTTTAATTTT CCAGTTGCTA 72 0 

ATTGAACAGT ATGGCCTCAT TATATATTTT GATTTATAGG AGTTTGTGTC TGGGCTCAAC 78 0 

ATGCTA 7 8 6 



(2) INFORMATION FOR SEQ ID NO: 17 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 9: 



TAGCATGTTG AGCCCAGACA CTGGTTACAA GACCAGACCT GCTTCCTCCA TATGTAAACA 6 0 

GCTTTTAAAA AGCCAGTGAA CCTTTTTAAT ACTTTGGCAA CCTTCTTTCA CAGGCAAAGA 12 0 

ACACCCCCAT CCGCCCCTTG TTTGGAGTGC AGAGTTTGGC TTTGGTTCTT TGCCTTGCCT 180 

GGAGTATACT TCTAATTCCT GTTGTCCTGC ACAAGCTGAA TACCGAGCTA CCCACCGCCA 24 0 

CCCAGGCCAG GTTTCCACTC ATTTATTACT TTATGTTTCT GTTCCATTGC TGGTCCACAG 3 00 

AAATAAGTTT TCCTTTGGAG GAATGTGATT ATACCCCTTT AATTTCCTCC TTTTGCTTTT 3 60 

TTTTAATATC ATTGGTATGT GTTTGGCCCA GAGGAAACTG AAATTCACCA TCATCTTGAC 42 0 

TGGCAATCCC ATTACCATGC TTTTTTTAAA AAACGTAATT TTTCTTGCCT TACATTGGCA 4 80 

GAGTAGCCCT TCCTGGCTAC TGGCTTAATG TAGTCACTCA GTTTCTAGGT GGCATTAGGC 54 0 

ATGAGACCTG AAGCACAGAC TGTCTTACCA CT^AAAGGTGA CAAGATCTCA AACCTTAGCC 6 00 

AAAGGGCTAT GTCAGGTTTC AATGCTATCT GCTTCTGTTC CTGCTCACTG TTCTGGATTT 6 60 

TGTCCTTCTT CATCCCTAGC ACCAGAATTT CCCAGTCTCC CTCCCTACCT TCCCTTGTTT 72 0 

TAATTCTAAT CTATCAGCAA AATAACTTTT CAAATGTTTT AACCGGTATC TCCATGTGTC 7 80 

TGGGCTCAAC ATGCTA 7 96 



(2) INF0R^4ATI0N FOR SEQ ID NO: 180; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 88 base pairs 
{3} TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
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GGATGTGCTG CAAGGCGATT AAGTTGGGTA ACGCCAGGGT TTTCCCAGTC ACGACGTTGT 60 

AAAACGACGG CCAGTGAATT GTAATACGAC TCACTATAGG GCGAATTGGG CCCGACGTCG 12 0 

CATGCTCCCG GCCGCCATGG CCGCGGGATA GCATGTTGAG CCCAGACACC TGCAGGTCAT 18 0 

TTGGAGAGAT TTTTCACGTT ACCAGCTTGA TGGTCTTTTT CAGGAGGAGA GACACTGAGC 24 0 

ACTCCCAAGG TGAGGTTGAA GATTTCCTCT AGATAGCCGG ATAAGAAGAC TAGGAGGGAT 3 00 

GCCTAGAAAA TGATTAGCAT GCAAATTTCT ACCTGCCATT TCAGAACTGT GTGTCAGCCC 3 60 

ACATTCAGCT GCTTCTTGTG " AACTGAAAAG AGAGAGGTAT TGAGACTTTT CTGATGGCCG 42 0 

CTCTAACATT GTAACACAGT AATCTGTGTG TGTGTGGGTG TGTGTGTGTG TCTGGGCTCA 4 80 

ACATGCTA 4gQ 

(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

TAGCATGTTG AGCCCAGACA CGGCGACGGT ACCTGATGAG TGGGGTGATG GCACCTGTGA 60 

AAAGGAGGAA CGTCATCCCC CATGATATTG GGGACCCAGA TGATGAACCA ' TGGCTCCGCG 12 0 

TCAATGCATA TTTAATCCAT GATACTGCTG ATTGGAAGGA CCTGAACCTG AAGTTTGTGC 18 0 

TGCAGGTTTA TCGGGACTAT TACCTCACGG GTGATCAAAA CTTCCTGAAG GACATGTGGC 24 0 

CTGTGTGTCT AGTAAGGGAT GCACATGCAG TGGCCAGTGT GCCAGGGGTA TGGTTCGTGT 3 00 

CTGGGCTCAA CATGCTA 317 

(2) INFORMATION FOR SEQ ID NO:182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

TAGCATGTTG AGCCCAGACA CTGGCTGTTA GCCAAATCCT CTCTCAGCTG CTCCCTGTGG 6 0 

TTTGGTGACT CAGGATTACA GAGGCATCCT GTTTCAGGGA ACAAAAAGAT TTTAGCTGCC 12 0 

AGCAGAGAGC ACCACATACA TTAGAATGGT AAGGACTGCC ACCTCCTTCA AGAACAGGAG 18 0 

TGAGGGTGGT GGTGAATGGG AATGGAAGCC TGCATTCCCT GATGCATTTG TGCTCTCTCA 24 0 

AATCCTGTCT TAGTCTTAGG AAAGGAAGTA AAGTTTCAAG GACGGTTCCG AACTGCTTTT 3 00 

TGTGTCTGGG CTCAACATGC TATCCCGCGG CCATGGCGGC CGGGAGCATG CGACGTCGGG 3 60 

CCCAATTCGC CCTATAGTGA GTCGTATTAC AATTCACTGG CCGTCGTTTT ACAACGTCGT 42 0 

GACTGGGAAA ACCCTGGCGT TACCCAACTT AATCGCCTTG CAGCACATCC CCCTTTCCCA 48 0 

GCTGGCGTAA TANCGAAAAG GCCCGCA 5 07 

(2) INFORMATION FOR SEQ ID NO:183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 

GATTTACGCT GCAACACTGT GGAGGTAGCC CTGGAGCAAG GCAGGCATGG ATGCTTCTGC 60 

AATCCCCAAA TGGAGCCTGG TATTTCAGCC AGGAATCTGA GCAGAGCCCC CTCTAATTGT 12 0 

AGCAATGATA AGTTATTCTC TTTGTTCTTC AACCTTCCAA TAGCCTTGAG CTTCCAGGGG 18 0 

AGTGTCGTTA ATCATTACAG CCTGGTCTCC ACAGTGTTGC AGCGTAA 22 7 

(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 184 : 

TTACGCTGCA ACACTGTGGA GCAGATTAAC ATCAGACTTT TCTATCAACA TGACTGGGGT 60 

TACTAAAAAG ACAACAAATC AATGGCTTCA AAAGTCTAAG GAATAATTTC GATACTTCAA 12 0 

CTTTATAAAA CCTGACAAAA CTATCAATCA AGCATAAAGA CAGATGAAGA ACATTTCCAG 180 

ATTTTGGCCA ATCAGATATT TTACCTCCAC AGTGTTGCAG CGTAA 2 25 

(2) INFORMATION FOR SEQ ID NO:185: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

GGCCCGACGT CGCATGCTCC CGGCCGCCAT GGCCGCGGGA TTCGTTAGGG TCTCTATCCA 6 0 

CTGGGACCCA TAGGCTAGTC AGAGTATTTA GAGTTGAGTT CCTTTCTGCT TCCCAGAATT 12 0 
TGAAAGAAAA GGAGTGAGGT GATAGAGCTG AGAGATCAGA TTTGCCTCTG AAGCCTGTTC - 180 

AAGATGTATG TGCTCAGACC CCACCACTGG GGCCTGTGGG TGAGGTCCTG GGCATCTATT 24 0 

TGAATGAATT GCTGAAGGGG AGCACTATGC CAAGGAAGGG GAACCCATCC TGGCACTGGC 3 00 

ACAGGGGTCA CCTTATCCAG TGCTCAGTGC TTCTTTGCTG CTACCTGGTT TTCTCTCATA 3 60 

TGTGAGGGGC AGGTAAGAAG AAGTGCCCRG TGTTGTGCGA GTTTTAGAAC ATCTACCAGT 42 0 

AAGTGGGGAA GTTTCACAAA GCAGCAGCTT TGTTTTGTGT ATTTTCACCT TCAGTTAGAA 4 80 

GAGGAAGGCT GTGAGATGAA TGTTAGTTGA GTGGAAAAGA CGGGTAAGCT TAGTGGATAG 54 0 

AGACCCTAAC GAATCACTAG TGCGGCCGCC TTGCAGGTCG ACCATATGGG AGAGCTC 5 97 

(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 



GGCCCGAAGT TGCATGTTCC CGGCCGCCAT GGCCGCGGGA TTCGTTAGGG TCTCTATCCA 60 

CTACCTAAAA AATCCCAAAC ATATAACTGA ACTCCTCACA CCCAATTGGA CCAATCCATC 12 0 

ACCCCAGAGG CCTACAGATC CTCCTTTGAT ACATAAGAAA ATTTCCCCAA ACTACCTAAC 18 0 

TATATCATTT TGCAAGATTT GTTTTACCAA ATTTTGATGG CCTTTCTGAG CTTGTCAGTG 24 0 

TGAACCACTA TTACGAACGA TCGGATATTA ACTGCCCCTC ACCGTCCAGG TGTAGCTGGC 30 0 

AACATCAAGT GCAGTAAATA TTCATTAAGT TTTCACCTAC TAAGGTGCTT AAACACCCTA 3 60 

GGGTGCCATG TCGGTAGCAG ATCTTTTGAT TTGTTTTTAT TTCCCATAAG GGTCCTGTTC 42 0 

AAGGTCAATC ATACATGTAG TGTGAGCAGC TAGTCACTAT CGCATGACTT GGAGGGTGAT 4 80 

AATAGAGGCC TCCTTTGCTG TTAAAGAACT CTTGTCCCAG CCTGTCAAAG TGGATAGAGA 54 0 

CCCTAACGAA TCACTAGTGC GGCCGCCTGC AGGTCGACCA TATGGGAGAG CTCCCAA 5 97 

(2) INFORMATION FOR SEQ ID NO: 187: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 



TCGTTAGGGT CTCTATCCAC TTGCAGGTAA AATCCAATCC TGTGTATATC TTATAGTCTT 

CCATATGTAG TGGTTCAAGA GACTGCAGTT CCAGAAAGAC TAGCCGAGCC CATCCATGTC 

TTCCACTTAA CCCTGCTTTG GGTTACACAT CTTAACTTTT CTGTTCAAGT TTCTCTGTGT 

AGTTTATAGC ATGAGTATTG GGAWAATGCC CTGAAACCTG ACATGAGATC TGGGAAACAC 

AAACTTACTC AATAAGAATT TCTCCCATAT TTTTATGATG GAAAAATTTC ACATGCACAG 
AGGAGTGGAT AGAGACCCTA ACGA 

(2) INFORMATION FOR SEQ ID NO: 18 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : ' linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

GCGCGGGGAT TCGGGGTGAT ACCTCCTCAT GCCAAAATAC AACGTNTAAT TTCACAACTT 
GCCTTCCAAT TTACGCATTT TCAATTTGCT CTCCCCATTT GTTGAGTCAC AACAAACACC 
ATTGCCCAGA AACATGTATT ACCTAACATG CACATACTCT' TAAAACTACT CATCCCTT 

(2) INFORMATION FOR SEQ ID NO: 189: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

TGACACCTTG TCCAGCATCT GACACAGTCT TGGCTCTTGG AAAATATTGG ATAAATGAAA 6 0 

ATGAATTTCT TTAGCAAGTG GTATAAGCTG AGAATATACG TATCACATAT CCTCATTCTA 12 0 

AGACACATTC AGTGTCCCTG AAATTAGAAT AGGACTTACA ATAAGTGTGT TCACTTTCTC 18 0 

AATAGCTGTT ATTCAATTGA TGGTAGGCCT TAAAAGTCAA AGAAATGAGA GGGCATGTGA 24 0 

AAAAAAGCTC AACATCACTG ATCATTAGAA AACTTCCATT CAAACCCCCA ATGAGATACC 300 

ATCTCATACC AGTCAGAATG GCTATTATTA AAAAGTCAAA AAATAACAGA TGCTGGACAA 3 60 

GGTGTCA 3 67 

(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

GACACCTTGT CCAGCATCTG ACAACGCTAA CAGCCTGAGG AGATCTTTAT TTATTTATTT . 6 0 

AGTTTTTACT CTGGCTAGGC AGATGGTGGC TAAAACATTC ATTTACCCAT TTATTCATTT 12 0 

AATTGTTCCT GCAAGGCCTA TGGATAGAGT ATTGTCCAGC ACTGCTCTGG AAGCTAGGAG 18 0 

CATGGGGATG AACAAGATAG GCTACATCCT GTTCCCACAG AACTTCCACT TTAGTCTGGG 24 0 

AAACAGATGA TATATACAAA TATATAAATG AATTCAGGTA GTTTTAAGTA CGAAAAGAAT 3 00 

AAGAAAGCAG AGTCATGATT TANAATGCTG GAAACAGGGG CTATTGCTTG AGATATTGAA 360 

GGTGCCCAA 359 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TGACACCTTG TCCAGCATCT GCACAGGGAA AAGAAACTAT TATCAGAGTG AACAGGCAAC 6 0 

CTACAGAATG GGAGAAAATT TTTGCAATCT ATCCATCTGA CAAAGGGCTA ATATCCAGAA 12 0 

TCTACAAAGA ACTTATACAA ATTTACAAGA AACAAACAAA CAAACAACTC CTCAAAAAGT 18 0 

GGGTGAAGGA TGTGAACAGA CACTTCTCAA AAGAAGACAT TTATGGGGCC AACAAACATA 24 0 

TGAAAAAAAG CTCATCATCA CTGGTCACTA GATAAATGCA AATCAAAACC ACAATGAGAT 3 00 

ACCATCTCAT TCCAGTTAGA ATGGCAATCA TTAAAAAGTC AGGAAACAAC AGATGCTGGA 3 60 

CAAGGTGTC 359 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 base pairs 

(B) TYPE: nucleic acid 
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60 
120 
180 
240 
300 
360 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

TGACGCTTGG CCACTTGACA CTTCATCTTT GCACAGAAAA ACTTCTTTAC AGATTTAATT 
CAAGACTGGT CTAGTGACAG TCCTCCAGAC ATTTTTTCAT TTGTTCCATA JaSctg^J 
TTTAAAATCA TGTTTCATCA GTTTGAAATG ATTTGGGCTG CTAA^S^CA cS^SSg^C 
SSJ^JIf CTAAACAACA GGAAAATGTG TATCTGGCAG CCTGtS^gJ SJIS^C 
ATTGATTTTT CTTTGCCTTT TACGGACTTT GTTCCAGCTA CATGTAATAC CAACT^m 

S™?tJa JJSgSIS LT'— tttctaccag actgc^JS? 

CC™G? G™S5? S^GcScT TGATTCAXGA TACTGGTATT 420 

449 

(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

TGACGCTTGG CCACTTGACA CCAGGGATGT AKCAGTTGAA TATAATCCTG C;^ ^ITTGTACA 
Ir^rrT'' ^^^^"^^^^^ ACATTCTAGA AAGAGACAAC CAGGAtSS SgccA^ 
AGCTGCAATA AATAACTGGT AATTGCAGTA ATCATTTCAG GCCAATTCAA TCCAgS^SS 

ctcagaggtg cctttggctg agagaagagg tgagatataa tgtgttSct ?ScJ?™? 
tggaagaata actccacaat agtctgagga ctagatacaa acctatttSc cat?^gS 

GG^?^^GCG? jr'"''"' TGTTGGAGAT TAGACTCCAG SSJ^S 360 

372 

(2) INFORMATION FOR SEQ ID N0:194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 09 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 



60 
120 
180 
240 
300 



60 
120 
180 
240 



TGACGCTTGG CCACTTGACA CTTATGTAGA ATCCATCGTG GGCTGATGCA AGCCCTTTAT 
TTAGGCTTAG TGTTGTGGGC ACCTTCAATA TCACACTAGA GACAAACGCC AcSSJSg 
CAGAAACATT CAGTTCTGAN CACTCGAATG GCAGGATAAC TTTTTG^gS S^J^TcSc 
i^JIJf AAACAAACTC TGCANTCTCA CGTTACAAAA AAACGTAcS SS^ISIS 
JSSc^cf "^^^^^^^ CCATCTATAA CAAAGTAACT TACAACTAGT G^cSSJ^S 300 

309 

(2) INFORMATION FOR SEQ ID NO: 195: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

TGACGCTTGG CCACTTGACA CCCAATCTCG CACTTCATCC TCCCAGCACC TGATGAAGTA 60 

GGACTGCAAC TATCCCCACT TCCCAGATGA GGGGACCAAN GTACACATTA GGACCCGGAT 12 0 

GGGAGCACAG ATTTGTCCGA TCCCAGACTC CAAGCACTCA GCGTCACTCC AGGACAGCGG 180 

CTTTCAGATA AGGTCACAAA CATGAATGGC TCCGAC7VACC GGAGTCAGTC CGTGCTGAGT 24 0 

TAAGGCAATG GTGACACGGA TGCACGTGTN ACCTGT7UVTG GTTCATCGTA AGTGTCAAGT 3 00 

GGCCAAGCGT CA 312 

(2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

TGTATCGACG TAGTGGTCTC CTCAGCCATG CAGAACTGTG ACTCAATTAA ACCTCTTTCC 60 

TTTATGAATT ACCCAATCTC GGGTAGTGTC TTTATAGTAG TGTGAGAATG GACTAATACA 12 0 

AGTACATTTT ACTTAGTAAT AATAATAAAC AAATATATTA CATTTTTGTG TATTTACTAC 180 

ACCATATTTT TTATTGTTAT TGTAGTGTAC ACCTTCTACT TATTAAAAGA AATAGGCCCG 24 0 

AGGCGGGCAG ATCACGAGGT CAGGAGATGG AGACCACTAC GTCGATAC " 2 88 

(2) INFORMATION FOR SEQ ID. NO:197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ' 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

TTGGGCACCT TCAATATCAT GACAGGTGAT GTGATTU^CCA AGAAGGCTAC TAAGTGATTA 60 

ATGGGTGGGT AATGTATACA GAGTAGGTAC ACTGGACAGA GGGGTAATTC ATAGCCAAGG 12 0 

CAGGAGAAGC AGAATGGCAA AACATTTCAT CACACTACTC AGGATAGCAT GCAGTTTAAA 180 

ACCTATAAGT AGTTTATTTT TGGAATTTTC CACTTAATAT TTTCAGACTG CAGGTAACTA 24 0 

AACTGTGGAA CACAAGAACA TAGATAAGGG GAGACCACTA CGTCGATAC 289 

(2) INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



BNSDOCID: <WO 9845328A2J_> 



wo 98/45328 



PCT/US98/06939 



102 



(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQIDNO:198: 

^rJ^^S^r AGTGGTCTCC CAAGCAGTGG GAAGAAAACG TGAACCAATT AAAATGTATC 
JSIJ^n^S; ^^^^^^G CTTGAGTAAA GATTCCAAGT GGGTCACaI? SJJJgaSS 
^Inni^'' CTGTCAAAGA GATTTGCTAT GAGGTTGCTC TCAATGACTT cSgSSot 

cggcaggaga ttgaagccct ggccattgtc aagatgaagg agctttgtgc ca?gSSgc 
aagaaagacc ccaatgagcg ggactcctgg agaccactac gtcgatac ''''^^''''^^^^ 

(2) INFORMATION FOR SEQ ID NO: 199: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
288 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

^^Mn^^I'"'''' AAAAACNCAA NTGGGGGAAA GGGGGNTTNN TNGCAAGGGG ATAAAGGGGG 
AANCCCAGGG TTTCCCCATT CAGGGAGGTG TAAAAAGNCG GCCAGGGgS Jg^AGgJ l.n 

— - ™- ~- ssss sEE 

SSS^S? ?~ ™i s~ 

sss is= ~- ss= - Ei 
=s s?—? e= ss— — ™ ~ 

TTTATATCCT TCCTGATTGA GAGTTACCAT AACCTGATCC ACAATGCCcS ^SS™ ^^n 
l^n.^"^ GTGCTACAGT CTCTCTTGGT ATAGCGTCTA TTGGTgSS cSJ^TCA^C 

TTCATTTTTC AGGCAAGGTG AACTGTTTTG CCTATAATAA CMTCATCTCC T^HmcH 780 
AACCCCKGGA RCTATCAAAC CATCATCATC CAGCGTTCKT watgtymcta StcJcSS 

^5°;^^°^^^ GCAGGTCAAG ATATNGGAAA ACCGCGCACC CGTTNGGAgS ^I^S^gII Inn 
TTTTCCATAT GTCCCNTAAA TTANCTNGNC TTANCCTGGC CNTAACC™5 tSSSSJJ 

JcS™'' ^^^^^^^^^^ CCNGCTTN.A ACCGGAAACG TTA^™ I^CNgSg^^ .O.'S 

1027 

(2) INFORMATION FOR SEQ ID NO: 2 00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 07 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: 

JJn^JS™ CGACGCTGGC CATCTTGAAT CCTAGGGCAT GAAGTTGCCC CAAAGTTCAG 
cJS™^'' ^^^^^^^^^^ CTCTGGTTTA TCACAAAGAA TAGGATGgSa T^tJ^ot 
GGACACTTAA ATAAGCTATA AATTATATGG TCCTTGTCTA GCAGGAGaS l^C^H 



60 
120 
180 
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TATACTACCA GCGTCGTAAT GTCACTA 2 07 

(2) INFORMATION FOR SEQ ID NO:2.01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION:' SEQ ID NO: 2 01: 



TGGGCACCTT CAATATCTAT TAAAAGCACA AATACTGAAG AACACACCAA GACTATCAAT 6 0 

GAGGTTACAT CTGGAGTCCT CGATATATCA GGAAAAAATG AAGTGAACAT TCACAGAGTT 12 0 

TTACTTCTTT GGGAACTCAA ATGCTAGAAA AGAAAAGGGT GCCCTCTTTC TCTGGCTTCC 18 0 

TGGTCCTATC CAGCGTCGTA ATGTCACTA 2 09 



(2) INFORMATION FOR SEQ ID NO:202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 



NTACGCTGCA ACACTGTGGA GCCACTGGTT TTTATTCCCG GCAGGTTATC CAGCAAACAG 6 0 

TCACTGAACA CACCGAAGAC CGTGGTATGG TAACCGTTCA CAGTAATCGT TCCAGTCGTC 12 0 

TGCGGGACCC CGACGAGCGT CACTGGGTAC AGACCAGATT CAGCCGGAAG AGAAAGCGCC 180 

GCAGGGAGAG ACTCGAACTC CACTCCGCTG GTGAGCAGCC CCATGTTTTC AACTCGAAGT 240 

TCAAACGGCA TTGGGTTATA TACCATCAGC TGAACTTCAC ACACATCTCC TTGAACCCAC 3 00 

TGGAAATCTA TTTTCTTGTT CCGCTCTTCT CCACAGTGTT GCAGCGTAA 34 9 



(2) INFORMATION FOR SEQ ID NO:203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 03: 



TGCTCCTCTT GCCTTACCAA CCCAAAGCCC ACTGTGAAAT A.TGAAGTGAA TGACAAAATT 6 0 

CAGTTTTCAA CGCAATATAG TATAGTTTAT CTGATTCTTT TGATCTCCAG GACACTTTAA 12 0 

ACAACTGCTA CCACCACCAC CAACCTAGGG ATTTAGGATT CTCCACAGAC CAGAAATTAT 18 0 

TTCTCCTTTG AGTTTCAGGC TCCTCTGGGA CTCCTGTTCA TCAATGGGTG GTAAATGGCT 24 0 

A 241 



(2) INFORMATION FOR SEQ ID NO: 2 04: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 04: 

TAGCCATTTA CCACCCATCT GCAAACCSWG ACMWWCARGR CYWGWACKYA GGCGATTTGA 

AGTACTGGTA ATGCTCTGAT CATGTTAGTT ACATAAGTGT GGTCAGTTTA CAAAAATTCA 

CAGAACTAAA TACTCAATGC TATGTGTTCA TGTCTGTGTT TATGTGTGTG TAATGTTTCA 

ATTAAGTTTT TTTAAAAAAA AGAGATGATT TCCAAATAAG AAAGCCGTGT TGGTAAGGCA 
AGAGGAGC 

(2) INFORMATION FOR SEQ ID NO: 2 05: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 05 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

TACGCTGCAA CACTGTGGAG CCATTCATAC AGGTCCCTAA TTAAGGAACA AGTGATTATG 6 0 

CTACCTTTGC ACGGTTAGGG TACCGCGGCC GTTAAACATG TGTCACTGGG CAGGCGGTGC 12 0 

CTCTAATACT GGTGATGCTA GAGGTGATGT TTTTGGTAAA CAGGCGGGGT AAGATTTGCC 18 0 

GAGTTCCTTT TACTTTTTTT AACCTTTCCT TATGAGCATG CCTGTGTTGG GTTGACAGTG 24 0 

GGGGTAATAA TGACTTGTTG GTTGATTGTA GATATTGGGC TGTTAATTGT CAGTTCAGTG 3 00 

TTTTAATCTG ACGCAGGCTT ATGCGGAGGA GAATGTTTTC ATGTTACTTA TACTAACATT 3 60 

AGTTCTTCTA TAGGGTGATA GATTGGTCCA ATTGGGTGTG AGGAGTTCAG TTATATGTTT 42 0 

GGGATTTTTT AGGTAGTGGG TGTTGANCTT GAACGCTTTC TTAATTGGTG GCTGCTTTTA 4 80 

RGCCTACTAT GGGTGGTAAA TGGCT 505 

(2) INFORMATION FOR SEQ ID NO: 206: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 06: 

TAGACTGACT CATGTCCCCT ACCAAAGCCC ATGTAAGGAG CTGAGTTCTT AAAGACTGAA 
GACAGACTAT TCTCTGGAGA AAAATAAAAT GGAAATTGTA CTTTAAAAAA AAAAAAAATC 
GGCCGGGCAT GGTAGCACAC ACCTGTAATC CCAGCTACTA GGGGACATGA GTCAGTCTA 

(2) INFORMATION FOR SEQ ID NO:207:' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 6 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

AGACTGACTC ATGTCCCCTA CCCCACCTTC TGCTGTGCTG CCGTGTTCCT AACAGGTCAC 6 0 

AGACTGGTAC TGGTCAGTGG CCTGGGGGTT GGGGACCTCT ATTATATGGG ATACAAATTT 12 0 

AGGAGTTGGA ATTGACACGA TTTAGTGACT GATGGGATAT GGGTGGTAAA TGGCTA 17 6 

(2) INFORMATION FOR SEQ ID NO: 2 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 

AGACTGACTC ATGTCCCCTA TTTAACAGGG TCTCTAGTGC TGTGAAAAAA AAAAATGCTG 6 0 

AACATTGCAT ATAACTTATA TTGTAAGAAA TACTGTACAA TGACTTTATT GCATCTGGGT 12 0 

AGCTGTAAGG CATGAAGGAT GCCAAGAAGT TTAAGGAATA TGGGTGGTAA ATGGCTAGGG 180 

GACATGAGTC AGTCTA 196 

(2) INFORMATION FOR SEQ ID NO: 2 09: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



GACGCTTGGC CACTTGACAC CTTTTATTTT TTAAGGATTC TTAAGTCATT TANGTNACTT 6 0 

TGTAAGTTTT TCCTGTGCCC CCATAAGAAT GATAGCTTTA AAAATTATGC TGGGGTAGCA 12 0 

AAGAAGATAC TTCTAGCTTT AGAATGTGTA GGTATAGCCA GGATTCTTGT GAGGAGGGGT 180 

GATTTAGAGC AAATTTCTTA TTCTCCTTGC CTCATCTGTA ACATGGGGAT AATAATAGAA 240 

CTGGCTTGAC AAGGTTGGAA TTAGTATTAC ATGGTAAATA CATGTAAAAT GTTTAGAATG 3 00 

GTGCCAAGTA TCTAGGAAGT ACTTGGGCAT GGGTGGTAAA TGGCT 345 



(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: .SEQ ID NO:210: 
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GACGCTTGGC CACTTGACAC TAGAGTAGGG TTTGGCCAAC TTTTTCTATA AAGGACCAGA 6 0 

GAGTAAATAT TTCAGGCTTT GTGGGTTGTG CAGTCTCTCT TGCAACTACT CAGCTCTGCC 12 0 

ATTGTAGCAT AGAAATCAGC CATAGACAGG ACAGAAATGA ATGGGTGGTA AATGGCTA 17 8 



(2) INFORMATION FOR SEQ ID NO: 211: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 11: 

TGGGCACCTT CAATATCTAT CCAGCGCATC TAAATTCGCT TTTTTCTTGA TTAAAAATTT 6 0 

CACCACTTGC TGTTTTTGCT CATGTATACC AAGTAGCAGT GGTGTGAGGC CATGCTTGTT 12 0 

TTTTGATTCG ATATCAGCAC CGTATAAGAG CAGTGCTTTG GCCATTAATT TATCTTCATT 18 0 

GTAGACAGCA TAGTGTAGAG TGGTATCTCC ATACTCATCT GGAATATTTG GATCAGTGCC 24 0 

ATGTTCCAGC AACATTAACG CACATTCATC TTCCTGGCAT TGTACGGCCT TTGTCAGAGC 3 00 

TGTCCTCTTT TTGTTGTCAA GGACATTAAG TTGACATCGT CTGTCCAGCA CGAGTTTTAC 3 60 

TACTTCTGAA TTCCCATTGG CAGAGGCCAG ATGTAGAGCA GTCCTCTTTT GCTTGTCCCT 42 0 

CTTGTTCACA TCAGTGTCCC TGAGCATAAC GGAA 4 54 

(2) INFORMATION FOR SEQ ID NO: 2 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 



TCCGTTATGC CACCCAGAAA ACCTACTGGA GTTACTTATT AACATCAAGG CTGGAACCTA 60 

TTTGCCTCAG TCCTATCTGA TTCATGAGCA CATGGTTATT ACTGATCGCA TTGAAAACAT 12 0 

TGATCACCTG GGTTTCTTTA TTTATCGACT GTGTCATGAC AAGGAAACTT ACAAACTGCA 180 

ACGCAGAGAA ACTATTAAAG GTATTCAGAA ACGTGAAGCC AGCAATTGTT TCGCAATTCG 24 0 

GCATTTTGAA AACAAATTTG CCGTGGAAAC TTTAATTTGT TCTTGT^CAG TCAAGAAAAA 3 00 

CATTATTGAG GAAAATTAAT ATCACAGCAT AACGGAA 33 7 



(2) INFORMATION FOR SEQ ID NO: 2 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:213: 
TCGGGTGATG CCTCCTCAGG CATCTTCCAT CCATCTCTTC AAGATTAGCT GTCCCAAATG 6 0 
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TTTTTCCTTC TCTTCTTTAC TGATAAATTT GGACTCCTTC TTGACACTGA TGACAGCTTT 12 0 

AGTATCCTTC TTGTCACCTT GCAGACTTTA AACATAAAAA TACTCATTGG TTTTAAAAGG 180 

AAAAAAGTAT ACATTAGCAC TATTAAGCTT GGCCTTGAAA CATTTTCTAT CTTTTATTAA 240 

ATGTCGGTTA GCTGAACAGA ATTCATTTTA CAATGCAGAG TGAGAAAAGA AGGGAGCTAT 3 00 

ATGCATTTGA GAATGCAAGC ATTGTCAAAT AAACATTTTA AATGCTTTCT TAAAGTGAGC 3 60 

ACATACAGAA ATACATTAAG ATATTAGAAA GTGTTTTTGC TTGTGTACTA CTAATTAGGG 42 0 

AAGCACCTTG TATAGTTCCT CTTCTAAAAT TGAAGTAGAT TTTAAAAACC CATGTAATTT 4 80 

AATTGAGCTC TCAGTTCAGA TTTTAGGAGA ATTTTAACAG GGATTTGGTT TTGTCTAAAT 54 0 

TTTGTCAATT TNTTTAGTTA ATCTGTATAA TTTTATAAAT GTCAAACTGT ATTTAGTCCG 6 00 

TTTTCATGCT GCTATGAAAG AAATACCCAN GACAGGGTTA TTTATAAANG GAAAGANGTT 660 

AATTTGACTC CCAGTTCACA GGCCTGAGGA NGNATCNCCC GAAATCCTTA TTGCG 715 



(2) INFORMATION FOR SEQ ID NO: 2 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 14: 



GGTAANGNGC ATACNTCGGT GCTCCGGCCG CCGGAGTCGG GGGATTCGGG TGATGCCTCC 6 0 

TCAGGCCCAC TTGGGCCTGC TTTTCCCAAA TGGCAGCTCC TCTGGACATG CCATTCCTTC 12 0 

TCCCACCTGC CTGATTCTTC ATATGTTGGG TGTCCCTGTT TTTCTGGTGC TATTTCCTGA 180 

CTGCTGTTCA GCTGCCACTG TCCTGCAAAG CCTGCCTTTT TAAATGCCTC ACCATTCCTT 24 0 

CATTTGTTTC TTAAATATGG GAAGTGAAAG TGCCACCTGA GGCCGGGCAC AGTGGCTCAC 3 00 

GCCTGTAATC CCAGCACTTT GGGAGCCTGA GGAGGCATCA CCCGA 345 



(2) INFORMATION FOR SEQ ID NO: 2 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleic acid 

(C) STRAl^DEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:215: 



GGTGATGCCT CCTCAGGCGA AGCTCAGGGA GGACAGAAAC CTCCCGTGGA GCAGAAGGGC 6 0 

AAAAGCTCGC TTGATCTTGA TTTTCAGTAC GAATACAGAC CGTGAAAGCG GGGCCTCACG 12 0 

ATCCTTCTGA CCTTTTGGGT TTTAAGCAGG AGGTGTCAGA AAAGTTACCA CAGGGATAAC 18 0 

TGGCTTGTGG CGGCCAAGCG TTCATAGCGA CGTCGCTTTT TGATCCTTCG ATGTCGGCTC 240 

TTCCTATCAT TGTGAAGCAG AATTCACCAA GCGTTGGATT GTTCACCCAC TAATAGGGAA 3 00 

CGTGAGCTGG GTTTAGACCG TCGTGAGACA GGTTAGTTTT ACCCTACTGA TGATGTGTKG 3 60 

TTGCCATGGT AATCCTGCTC AGTACGAGAG GAACCGCAGG TTCASACATT TGGTGTATGT 42 0 

GCTTGCCTT 4 2 9 



(2) INFORMATION FOR SEQ ID NO: 2 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 16: 

TGACACCTAT GTCCNGCATC TGTTCACAGT TTCCACAAAT AGCCAGCCTT TGGCCACCTC 60 

TCTGTCCTGA GGTATACAAG TATATCAGGA GGTGTATACC TTCTCTTCTC TTCCCCACCA 12 0 

AAGAGAACAT GCAGGCTCTG GAAGCTGTCT TAGGAGCCTT TGGGCTCAGA ATTTCAGAGT 18 0 

CTTGGGTACC TTGGATGTGG TCTGGAAGGA GAAACATTGG CTCTGGATAA GGAGTACAGC 24 0 

CGGAGGAGGG TCACAGAGCC CTCAGCTCAA GCCCCTGTGC CTTAGTCTAA AAGCAGCTTT 3 00 

GGATGAGGAA GCAGGTTAAG TAACATACGT AAGCGTACAC AGGTAGAAAG TGCTGGGAGT 3 60 

CAGAATTGCA CAGTGTGTAG GAGTAGTACC TCAATCAATG AGGGCAAATC AACTGAAAGA 420 

AGAAGACCNA TTAATGAATT GCTTANGGGG AAGGATCAAG GCTATCATGG AGATCTTTCT 4 80 

AGGAAGATTA TTGTTTANAA TTATGAAAGG ANTAGGGCAG GGACAGGGCC AGAAGTANAA 5 4 0. 

GANAACATTG CCTATANCCC TTGTCTTGCA CCCAGATGCT GGACAAGGTG TCA 5 93 

(2) INFORMATION FOR SEQ ID NO: 2 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 17: 

TGACACCTTG TCCAGCATCT GACGTGAAGA TGAGCAGCTC AGAGGAGGTG TCCTGGATTT 6 0 

CCTGGTTCTG TGGGCTCCGT GGCAATGAAT TGTTCTGTGA AGTGGATGAA GACTACATCC 12 0 

AGGACAAATT TAATCTTACT GGACTCAATG AGCAGGTCCC TCACTATCGA CAAGCTCTAG 18 0 

ACATGATCTT GGACCTGGAG CCTGATGAAG AACTGGTVAGA CAACCCCAAC CAGAGTGAGC 24 0 

TGATTGAGCA GGCAGCCGAG ATGCTTTATG GATTGATCCA CGCCCGCTAC ATCCTTACCA 3 00 

ACCGTGGCAT CGCCCAGATG CTGGACAAGG TGTCA 33 5 

(2) INFORMATION FOR SEQ ID N0:218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 18: 

TACGTACTGG TCTTGAAGGT CTTAGGTAGA GAAAAAATGT GAATATTTAA TCAAAGACTA 6 0 

TGTATGAAAT GGGACTGTAA GTACAGAGGG AAGGGTGGCC CTTATCGCCA GAAGTTGGTA • 12 0 

GATGCGTCCC CGTCATGAAA TGTTGTGTCA CTGCCCGACA TTTGCCGAAT TACTGAAATT 18 0 

CCGTAGAATT AGTGCAAATT CTAACGTTGT TCATCTAAGA TTATGGTTCC ATGTTTCTAG 24 0 

TACTTTTA 24 8 

(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 19: 

TGACGCTTGG CCACTTGACA CAAGTAGGGG ATAAGGACAA AGACCCATNA GGTGGCCTGT 60 

CAGCCTTTTG TTACTGTTGC TTCCCTGTCA CCACGGCCCC CTCTGTAGGG GTGTGCTGTG 12 0 

CTCTGTGGAC ATTGGTGCAT TTTCACACAT ACCATTCTCT TTCTGCTTCA CAGCAGTCCT 180 

GAGGCGGGAG CACACAGGAC TACCTTGTCA GATGANGATA ATGATGTCTG GCCAACTCAC 24 0 

CCCCCAACCT TCTCACTAGT TATANGAAGA GCCANGCCTA NAACCTTCTA TCCTGNCCCC 3 00 

TTGCCCTATG ACCTCATCCC TGTTCCATGC CCTATTCTGA TTTCTGGTGA ACTTTGGAGC 3 60 

AGCCTGGTTT NTCCTCCTCA CTCCAGCCTC TCTCCATACC ATGGTANGGG GGTGCTGTTC 42 0 

CACNCAAANG GTCAGGTGTG TCTGGGGAAT CCTNANANCT GCCNGGAGTT TCCNANGCAT 4 80 

TCTTAAAAAC CTTCTTGCCT AATCANATNG TGTCCAGTGG CCAACCNTCN 53 0 

(2) INFORMATION FOR SEQ ID NO: 22 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 0: 



TGACGCTTGG CCACTTGACA CTAAATAGCA TCTTCTAAAG GCCTGATTCA GAGTTGTGGA 6 0 

AAATTCTCCC AGTGTCAGGG ATTGTCAGGA ACAGGGCTGC TCCTGTGCTC ACTTTACCTG 12 0 

CTGTGTTTCT GCTGGAAAAG GAGGGAAGAG GAATGGCTGA TTTTTACCTA ATGTCTCCCA 18 0 

GTTTTTCATA TTCTTCTTGG ATCCTCTTCT CTGACAACTG TTCCCTTTTG GTCTTCTTCT 24 0 

TCTTGCTCAG AGAGCAGGTC TCTTTAAAAC TGAGAAGGGA GAATGAGCAA ATGATTAAAG 3 00 

AAAACACACT TCTGAGGCCC AGAGATCAAA TATTAGGTAA ATACTAAACC GCTTGCCTGC 3 60 

TGTGGTCACT TTTCTCCTCT TTCACATGCT CTATCCCTCT ATCCCCCACC TATTCATATG 42 0 

GCTTTTATCT GCCAAGTTAT CCGGCCTCTC ATCAACCTTC TCCCCTAGCC TACTGGGGGA 4 80 

TATCCATCTG GGTCTGTCTC TGGTGTATTG GTGTCAAGTG GCCAAGCGTC A 531 



(2) INFORMATION FOR SEQ ID NO: 221: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

ATTGACGCTT GGCCACTTGA CACCCGCCTG CCTGCAATAC TGGGGCAAGG GCCTTCACTG 6 0 

CTTTCCTGCC ACCAGCTGCC ACTGCACACA GAGATCAGAA ATGCTACCAA CCAAGACTGT 12 0 

TGGTCCTCAG CCTCTCTGAG GAGAAAGAGC AGAAGCCTGG AAGTCAGAAG AGAAGCTAGA 18 0 

TCGGCTACGG CCTTGGCAGC CAGCTTCCCC ACCTGTGGCA ATAAAGTCGT GCATGGCTTA 24 0 

ACAATGGGGG CACCTCCTGA GAAACACATT GTTAGGCAAT TCGGCGTGTG TTCATCAGAG 3 00 
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CATATTTACA CAAACCTCGA TAGTGCAGCC TACTATCCAC TATTGCTCCT ACGCTGCAAA 

CCTGAACAGC ATGGGACTGT ACTGAATACT GGAAGCAGCT GGTGaSSI CT^aItT^ 

TATCTAAACA CAGAGAAGGT ACAGTAAGAA TATGGTATCA TAAACTTACA SgACCGCCA 

TCCTATATGC AGTCTGTTGT GACCAAAATG TGTCAAGTGG CCAAGCGTCA ''''''''''^^''''^ 

(2) INFORMATION FOR SEQ ID NO: 222; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

TGTATCGACG TAGTGGTCTC CGGGCTACTA GGCCGTTGTG TGCTGGTAGT ACCTGGTTCA 
CTGAAAGGCG CATCTCCCTC CCCGCGTCGC CCTGAAGCAG GGGGAGGACT JcGCCCAGCC 
AAGGCAGTTG TATGAGTTTT AGCTGCGGCA CTTCGAGACC TCTGAgSS ^SccScaS 
GAGCCTTCCC CGATTAAGGA AGCCAGGGTA AGGATTCCTT CCTCCCCCaS aScSSJ^ 
CAAACCACCA CCCCCCCTAT TCTGGCAGCC CATATACATC AGaJcgSJc JSSJS^ 
AATAAACNAA AACCAAAAAA AAAAGAGAAG GGGAAATGTA TATGTCTGTC C^^SJS 
rJIn^^^'''''' TCAGCTCCTA NAGGGCAGGG ACCGTGTCTT CCGAATGgJc SJgSgSc 
CGACTGCGGG AAGTATCGGA GGAGGAAGCA GAGTCAGCAG AAGTTGAACG GTGgScCgS 

cggctcttgg gggctggtgt tgtacttcga gaccgctttc gc??SJJtc tSgaSSc 
gtttgctctt tggagtggga naccactacn tcnataca '^^^^^ ttagatttac 

(2) information for seq id no: 2 23: 

(i) sequence characteristics: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

IS^;^^''^^ TAGTGGTCTC CTCTTGCAAA GGACTGGCTG GTGAATGGTT TCCCTGAATT 
ATGGACTTAC CCT^^^CATA TCTTATCATC ATTACCAGTT GCAAAATATT AgSS^S^J 
GTCACTGTTT CATTTGATTC CTAGAAGGTT AGTCTTAGAT ATGtJ^SS J^SStcStG 

ctgtagtgct ttgaatgcat tttttgtttg catttttgtt tgcccaacct gtcS?SS 

GCTGCTTAGG TCTGGACTGT CCTGGATAAA GCTGTTAAAA TaScJScS ^CciS^AS 
S^^JJ^f ^ ATTAAGTCAA CTAAATGCTT CCTTGTTTTG CCAgSSg? SJS^cSS 

ctcaatttct gggttcattt tgggtgccct aaatcttagg gtgtgacttt cttagcatcc 
tgtaacatcc attcccaagc aagcacaact tcacataata ctttccSII SSaSgS 
.gaagcctttc cttcacccag cggagcaact tgattttcta caIc?ScJ? cScagIgcc 

ACAAGAGTAT GGGATATGGA GACCACTACG TCGATACA ^^^^^^^^ CATCAGAGCC 
(2) INFORMATION FOR SEQ ID NO:224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345. base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:224: 



TGTATCGACG TANTGGTCTC CCAAGGTGCT GGGATTGCAG GCATGAGCCA CCACTCCCAG 60 

GTGGATCTTT TTCTTTATAC TTACTTCATT AGGTTTCTGT TATTCAAGAA GTGTAGTGGT 12 0 

AAAAGTCTTT TCAATCTACA TGGTTAAATA ATGATAGCCT GGGAAATAAA TAGAAATTTT 18 0 

TTCTTTCATC TTTAGGTTGA ATAAAGAAAC AGAAAAAATA GAACATACTG AAAATAATCT 24 0 

AAGTTCCAAC CATAG7VAGAA CTGCAGAAGA AATGAAGAAA GTGATGATGA TTTAGATTTT 3 00 

GATATTGATT TAGAAGACAC AGGAGGAGAC CACTACGTCG ATACA 3 45 

(2) INFORMATION FOR SEQ ID NO: 22 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

TGTATCGACG TAGTGGTCTC CAAACTGAGG TATGTGTGCC ACTAGCACAC AAAGCCTTCC 6 0 

AACAGGGACG CAGGCACAGG CAGTTTAAAG GGAATCTGTT TCTAAATTAA TTTCCACCTT 12 0 

CTCTAAGTAT TCTTTCCTAA AACTGATCAA GGTGTGAAGC CTGTGCTCTT TCCCAACTCC 18 0 

CCTTTGACAA CAGCCTTCAA CTAACACAAG AAAAGGCATG TCTGACACTC TTCCTGAGTC 24 0 

TGACTCTGAT ACGTTGTTCT GATGTCTAAA GAGCTCCAGA ACACCAAAGG GACAATTCAG 3 00 

AATGCTGGTG TATAACAGAC TCCAATGGAG ACCACTACGT CGATACA 34 7 

(2) INFORMATION FOR SEQ ID NO: 22 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 



AGGNGNGGGA NTGTATCGAC GTAGTGGTCT CCCAACAGTC TGTCATTCAG TCTGCAGGTG 6 0 

TCAGTGTTTT GGACAATGAG GCACCATTGT CACTTATTGA CTCCTCAGCT CTAAATGCTG 12 0 

AAATTAAATC TTGTCATGAC AAGTCTGGAA TTCCTGATGA GGTTTTACAA AGTATTTTGG 18 0 

ATCAATACTC CAACAAATCA GAAAGCCAGA AAGAGGATCC TTTCAATATT GCAGAACCAC 24 0 

GAGTGGATTT ACACACCTCA GGAGACCACT ACGTCGATAC A 281 



(2) INFORMATION FOR SEQ ID NO: 227: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 



GGGAAACACT TCCTCCCAGC CTTGTAAGGG TTGGAGCCCT CTCCAGTATA TGCTGCAGAA 6 0 

TTTTTCTCTC GGTTTCTCAG AGGATTATGG AGTCCGCCTT AAAAAAGGCA AGCTCTGGAC 12 0 

ACTCTGCAAA GTAGAATGGC CAAAGTTTGG AGTTGAGTGG CCCCTTGAAG GGTCACTGAA 18 0 



BNSDOCID: <WO 9845328A2_I_> 



wo 98/45328 



PCT/US98/06939 



112 



CCTCACAATT GTTCAAGCTG TGTGGCGGGT TGTTACTGAA ACTCCCGGCC TCCCTGATCA 24 0 

GTTTCCCTAC ATTGATCAAT GGCTGAGTTT GGTCAGGAGC ACCCCTTCCG TGGCTCCACT 3 00 

CATGCACCAT TCATAATTTT ACCTCCAAGG TCCTCCTGAG CCAGACCGTG TTTTCGCCTC 3 60 

GACCCTCAGC CGGTTCGGCT CGCCCTGTAC TGCCTCTCTC TGAAGAAGAG GAGAGTCTCC 42 0 

CTCACCCAGT CCCACCGCCT TAAAACCAGC CTACTCCCTT AGGGTCATCC CATGTCTCCT 48 0 

CGGCTATGTC CCCTGTAGGC TCATCACCCA TTGCCTCTTG GTTGCAACCG TGGTGGGAGG 54 0 

AAGTAGCCCC TCTACTACCA CTGAGAGAGG CACAAGTCCC TCTGGGTGAT GAGTGCTCCA 60 0 

CCCCCTTCCT GGTTTATGTC CCTTCTTTCT ACTTCTGACT TGTATAATTG GAAAACCCAT 66 0 

AATCCTCCCT TCTCTGAAAA GCCCCAGGCT TTGACCTCAC TGATGGAGTC TGTACTCTGG 72 0 

ACACATTGGC CCACCTGGGA TGACTGTCAA CAGCTCCTTT TGACCCTTTT CACCTCTGAA 78 0 

GAGAGGGAAA GTATCCAAAG AGAGGCCAAA AAGTACAACC TCACATCAAC CAATAGGCCG 84 0 

GAGGAGGAAG . CTAGAGGAAT AGTGATTAGA GACCCAATTG GGACCTAATT GGGACCCAAA 90 0 

TTTCTCAAGT GGAGGGAGAA CTTTTGACGA TTTCCACCGG TATCTCCTCG TGGGTATTCA 96 0 

GGGAGCTGCT CAGAAACCTA TAAACTTGTC TAAGGCGACT GAAGTCGTCC AGGGGCATGA 1020 

TGAGTCACCA GGAGTGTTTT TAGAGCACCT CCAGGAGGCT TATCAGATTT ACACCCCTTT 10 8 0 

TGACCTGGCA GCCCCCGAAA ATAGCCATGC TCTTAATTTG GCATTTGTGG CTCAGGCAGC 114 0 

CCCAGATAGT AAAAGGAAAC TCCAAAAACT AGAGGGATTT TGCTGGAATG AATACCAGTC 12 0 0 

AGCTTTTAGA GATAGCCTAA AAGGTTTTTG ACAGTCAAGA GGTTGAAAAA CAAAAACAAG 12 6 0 

CAGCTCAGGC AGCTGAAJUVA AGCCACTGAT AAAGCATCCT GGAGTATCAG AGTTTACTGT 13 2 0 

TAGATCAGCC TCATTTGACT TCCCCTCCCA CATGGTGTTT AAATCCAGCT ACACTACTTC 13 8 0 

CTGACTCAAA CTCCACTATT CCTGTTCATG ACTGTCAGGA ACTGTTGGAA ACTACTGAAA 14 4 0 

CTGGCCGACC TGATCTTCAA AATGTGCCCC TAGGAAAGGT GGATGCCACC ATGTTCACAG 15 00 

ACAGTAGCAG CTTCCTCGAG AAGGGACTAC GAAAGGCCGG TGCAGCTGTT ACCATGGAGA 15 6 0 

CAGATGTGTT GTGGGCTCAG GCTTTACCAG CAAACACCTC AGCACAAAAG GCTGAATTGA 162 0 

TCGCCCTCAC TCAGGCTCTC CGATGGGGTA AGGATATTAA CGTTAACACT GACAGCAGGT 16 8 0 

ACGCCTTTGC TACTGTGCAT GTACGTGGAG CCATCTACCA GGAGCGTGGG CTACTCACCT 17 4 0 

CAGCAGGTGG CTGTAATCCA CTGTAAAGGA CATCAAAAGG AAAACACGGC TGTTGCCCGT 18 00 

GGTAACCAGA A^.GCTGATTC AGCAGCTCAA GATGCAGTGT GACTTTCAGT CACGCCTCTA 186 0 

AACTTGCTGC CCACAGTCTC CTTTCCACAG CCAGATCTGC CTGACAATCC CGCATACTCA 192 0 

ACAGAAGAAG A^J^ACTGGCC TCAGAACTCA GAGCCAATAA AAATCAGGAA GGTTGGTGGA 19 8 0 

TTCTTCCTGA CTCTAGAATC TTCATACCCC GAACTCTTGG GAAAACTTTA ATCAGTCACC 2 04 0 

TACAGTCTAC CACCCATTTA GGAGGAGCAA AGCTACCTCA GCTCCTCCGG AGCCGTTTTA 2100 

AGATCCCCCA TCTTCAAJ^GC CTAACAGATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC 2160 

AGGTAAATGC CAAAAAAGGT CCTAAACCCA GCCCAGGCCA CCGTCTCCAA GAAAACTCAC 22 2 0 

CAGGAGAAAA GTGGGAAATT GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAT 22 80 
ACCTTCTAGT ACTGGTAGAC ACCTTCTCTG GATGGACTGA AGCATTTGCT ACCAAAAACG ■ 2 34 0 

AAACTGTCAA TATGGTAGTT AAGTTTTTAC TCAATGAAAT CATCCCTCGA CATGGGCTGC 24 00 

CTGTTTGCCA TAGGGTCTGA TAATGGACCG GCCTTCGCCT TGTCTATAGT TTAGTCAGTC 24 60 

AGTAAGGCGT TAJ^Ji^CATTCA ATGGAAGCTC CATTGTGCCT ATCGACCCCA GAGCTCTGGG 2 52 0 

CAAGTAGAAC GCATGAACTG CACCCTAAAA AACACTCTTA CAAAATTAAT CTTAGAAACC 2 5 80 

GGTGTAAATT GTGTAAGTCT CCTTCCTTTA GCCCTACTTA GAGTAAGGTG CACCCCTTAC 2 64 0 

TGGGCTGGGT TCTTACCTTT TGAAATCATG TATGGGAGGG TGCTGCCTAT CTTGCCTAAG 2 7 00 

CTAAGAGATG CCCAATTGGC AAAAATATCA CAAACTAATT TATTACAGTA CCTACAGTCT 2 760 

CCCCAACAGG TACAAGATAT CATCCTGCCA CTTGTTCGAG GAACCCATCC CAATCCAATT 2 82 0 

CCTGAACAGA CAGGGCCCTG CCATTCATTC CCGCCAGGTG ACCTGTTGTT TGTTAAAAAG 2 8 80 

TTCCAGAGAG AAGGACTCCC TCCTGCTTGG AAGAGACCTC ACACCGTCAT CACGATGCCA 2 94 0 

ACGGCTCTGA AGGTGGATGG CATTCCTGCG TGGATTCATC ACTCCCGCAT CAAAAAGGCC 3 00 0 

AACAGAGCCC AACTAGAAAC ATGGGTCCCC AGGGCTGGGT CAGGCCCCTT AAAACTGCAC 3 060 

CTAAGTTGGG TG.-JVGCCATT AGATTAATTC TTTTTCTTAA TTTTGTAAAA CAATGCATAG 312 0 

CTTCTGTCAA ACTTATGTAT CTTAAGACTC AATATAACCC CCTTGTTATA ACTGAGGAAT 3180 

CAATGATTTG ATTCCCCCAA AAACACAAGT GGGGAATGTA GTGTCCAACC TGGTTTTTAC 3 24 0 

TAACCCTGTT TTTAGACTCT CCCTTTCCTT TAATCACTCA GCTTGTTTCC ACCTGAATTG 33 00 

ACTCTCCCTT AGCTAAGAGC GCGAGATGGA CTCCATCTTG GCTCTTTCAC TGGCAGCCGC 33 60 

TTCCTCAAGG ACTTAACTTG TGCAAGCTGA CTCCCAGCAC ATCCAAGAAT GCAATTAACT 342 0 

GATAAGATAC TGTGGCAAGC TATATCCGCA GTTCCCAGGA ATTCGTCCAA TTGATCACAG 34 8 0 
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CCCCTCTACC CTTCAGCAAC CACCACCCTG ATCAGTCAGC AGCCATCAGC ACCGAGGCAA 
GGCCCTCCAC CAGCAAAAAG ATTCTGACTC ACTGAAGACT TGGATGATCA TTAGTATTTT 
TAGCAGTAAA GTTTTTTTTT CTTTTTCTTT CTTTTTTTCT CGTGCC 

(2) INFORMATION FOR SEQ ID NO:228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419. base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

TAAGAGGGTA CAAGATCTAA GCACAGCCGT CAATGCAGAA CACAGAACGT AGCCTGGTAA 

GTGTGTTAAG AGTGGGAATT TTTGGAGTAC AGAGTAAGGC ACCTAACCCT AGCTGGGGTT 

TGGTGACGGT CCCAGATGGC TTACAGAAGA AAGTGTCCTG AGATGAGTTT TTAAGAATGA 

ATAAGGATAG ACACAAGTGA GGACTGACTT GGCAGTGGTG AATGGTGGGT GGCAAAAAAC 

TTCGCATGTA TGGAAACTGC ACGTACAGGA ATGAAGAATG AGACTGTGTG GTGTTTAATG 

AGCTGCAAAT ACTAATTTTA TCCTGAAAGT TTTGAAGAGT TAACTAAAAA GTATTTTTTA 

GTAAGGAAAT AACCCTACAT TTCAGGGTTA TTGTTTGTTT AMATATTGAA GGTGCCCAA 

(2) INFORMATION FOR SEQ ID NO: 22 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 9: 

AAGAGGGTAC CTGTATGTAG CCATGGTGGC AATGAGAGAC TGATTACTAC CTGCTGGAGA 6 0 

TTGTTTAAGT GAGTTAATAT ATTAAGGATA AAGGGAGCCA GGTTTTTTGA CTGTTGGAGA 12 0 

AGGAAATTAC AGATATTGAA GGTCCCAA 14 8 

(2) INFORMATION FOR SEQ ID NO: 23 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hcmo Sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 



TAAGAGGGTA CMAAAAAAAA AAAATAGAAC GAATGAGTAA GACCTACTAT TTGATAGTAC 

AACAGGGTGA CTATAGTCAA TGATAACTTA ATTATACATT TAACATAGAG TGTAATTGGA 

TTGTTTGTAA CTCGAAGGAT AAATGCTTGA GAGGATGGAT ACCCCATTCT CCATGATGTA 

CTTATTTCAC ATTACATGCC TGTATCAAAG CATCTCATAT ACCCTATAAA TATGTACACC 
TACTATGTAC CCTCTTA 

(2) INFORMATION FOR SEQ ID NO: 231: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

■ J 

TAAGAGGGTA CGGGTATTTG CTGATGGGAT TTTTTTTTCT TTCTTTTTCT TTGGAAAACA 
AAATGAAAGC CAGAACAAAA TTATTGAACA AAAGACAGGG ACTAAATCTG GAGAAATGAA 
GTCCCCTCAC CTGACTGCCA TTTCATTCTA TCTGACCTTC CAGTCTAGGT TAGGAGAATA 
GGGGGTGGAG GGGATTAATC TGATACAGGT ATATTTAAAG CAACTCTGCA TGTGTGCCAG 
AAGTCCATGG TACCCTCTTA 

(2) INFORMATION FOR SEQ ID NO:232: 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 



TGCTCCTCTT GCCTTACCAA CCACAAATTA GAACCATAAT GAGATGTCAC CTCATACCTG 

GTGGGATTAA CATTATTTAA AAAATCAGAA GTATTGACAA GGATGTGAAG AAATTAGAAC 

ATCTGTGCAC TGTTGGTGGG AATGTAAAAA AGGTGTGGCC ACTATGGGTA ACAGCATGAA 

GGTTCCTCAA AAAAAATTTT TTTTAATCTA CTCTATGATC GATCTTGAGG TTGTTTATGC 

AAAAGAACTG AAATCAGGAT TTTGAGGAAA TATTCACATT CCCACATCCA TTTCTGCTTT 

ATTCATAATA CTCAAGAGAT GGAAACAACC TAAATGTCCA TCCCGGGATG AATGGATAAA 

CACAGTGTGG TATATGCATA CAATGGAATA TTATTTAGTC TTTAAAAAGA AAAATTCTAT 
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CATATACTAC AACTTANATN AACCTTGAGG ACACAATGCT NAGTGAAATA AGCCACGGAA 
GGACGAATAC TGCATTATTC CCTTATATGA AGTATCTAAA GTGGTCAAAC TCTTANAGCA 
NAAAGTAAAA ATGGGTGGTT GCCANACAGT TGGTTAGGCN AGAAGANAAN CCTANT 

(2) INFORMATION FOR SEQ ID NO: 23 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 

TCTTCTGAAG ACCTTTCGCG ACTCTTAAGC TCGTGGTTGG TAAGGCAAGA GGAGCGTTGG 
TAAGGCAAGA GGAGCGTTGG TAAGGCAAGA GGAGCA 

(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 base pairs 

(B) TYPE": nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

TGTAAGTCGA GCAGTGTGAT GATAAAACTT GAATGGATCA ATAGTTGCTT CTTATGGATG 
AGCAAAGAAA GTAGTTTCTT GTGATGGT^T CTGCTCCTGG CAAAAATGCT GTGAACGTTG 
TTGAAAAGAC AACAAAGAGT TTAGAGTAGT ACATAAATTT AGAATAGTAC ATAAACTTAG 
AATAGTACAT AAACTTAGTA CATAAATAAT GCACGAAGCA GGGGCAGGGC TTGAGAGAAT 
TGACTTCAAT TTGGAAAGAG TATCTACTGT AGGTTAGATG CTCTCAAACA GCATCACACT 
GCTCGACTTA CAA 

(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 5: 

AACGAGGACA GATCCTTAAA AAGAATGTTG AGTGAAAAAA GTAGAAAATA AGATAATCTC 
CAAAGTCCAG TAGCATTATT TAAACATTTT TAAAAAATAC ACTGATAAAA ATTTTGTACA 
TTTCCCAAAA ATACATATGG AAGCACAGCA GCATGAATGC CTATGGGRTT GAGGATAGGG 
GTTGGGAGTA GGGATGGGGA TAAAGGGGGA AAATAAAACC AGAGAGGAGT CTTACACATT 
TCATGAACCA AGGAGTATAA TTATTTCAAC TATTTGTACC WGAAGTCCAG AAAGAGTGGA 
GGCAGAAGGG GGAGAAGAGG GCGAAGAAAC GTTTTTGGGA GAGGGGTCCC ASAAGAGAGA 
TTTTCGCGAT GTGGCGCTAC ATACGTTTTT CCAGGATGCC TTAAGCTCTG CACCCTATTT 
TTCTCATCAC TAATATTAGA TTAAACCCTT TGAAGACAGC GTCTGTGGTT TCTCTACTTC 
AGCTTTCCCT CCGTGTCTTG CACACAGTAG CTGTTTTACA AGGGTTGAAC TGACTGAAGT 
GAGATTATTC 

(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 6: 

TAGACTGACT CATGTCCCCT ACCAGAGTAG CTAGAATTAA TAGCACAAGC CTCTACACCC 

AGGAACTCAC TATTGAATAC ATAAATGGAA TTTATTCAGC CTTAAAAAGT TTGGAAGGAA 

ATTCTGACAT ATGCTAAAAC ATGGATGAAC CTTGAAGACT TTATGATAAG TAAAAGAAGC 

CAGTCATAAA AGGJ^J^JU^ATA TTGCATGATT CCACTTATAT GAGGTACCTA GAGTAGTCAA 

TTTCATAGAA ACAC^-AAATA GAATGGTGTT TGCCAGGGCT TTTGAGGAAA AGGGAATGAC 
AAGTTAGGGG ACATGAGTCA GTCTA 

(2) INFORMATION FOR SEQ ID NO: 2 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECb-LE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 7: 

TAGACTGACT CATGTCCCCT ATCTACTCAA CATTTCCACT TGAAGTCTGA TAGGCATCTC 
AGACTTATCT TGTCCCAAAG CAAACTCTTT ATTTCTTTTC ATCCTAGTCT TTATTTCTTG 
TGCTGTCTTA CCCATCTCAA AAGAGTGCCA AAATCCACCA AGTTGCTGAA ACAGAAATCT 
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AAGAAATATC CTTGATTCTT CTTTTTCCCA TCTACTTCAC TTCTAATTCA TTAGTAAATA 24 0 

ATCTGTTTCA GAAAACCAAA CACCTCATGT TCTCACTCAT AAGGGGGAGT TGAACAATGA 3 00 

GAACACACAG ACACAGGGAG GGGAACATCA CACACCACGG CCCGTCAGGG AGTANGGGAC 
ATGAGTCAGT CTA 



360 
373 



(2) INFORMATION FOR SEQ ID NO: 23 8: 

(i) SEQUENCE CHA3UVCTERISTICS : 

(A) LENGTH: 4 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 8: 

TAGACTGACT CATGTCCCCT ATAATGCTCC CAGGCATCAG AAAGCATCTC AAACTGGAGC 60 

TGACACCATG GCAGAGGTTT CAGGTAAGTC ACAAAAGGGG TCCTAAAGAA TTTGCCCTCA 12 0 

ATATCAGAGT GATTAGAAGA AGTGGACAGA GCTACCCAAG TTAAACATAT GCGAGATAAA 180 

AAAAATATGG CACTTGTGAA CACACACTAC AGGAGGAAAA TAAGGAACAT AATAGCATAT 24 0 

TGTGCTATTA TGATGATGAA GAACCTCTCT ANAAGAAAAC ATAACCAAAG AAACAAAGAA 3 00 

AATTCCTGCN AATGTTTAAT GCTATAGAAG AAATTAACAA AAACATATAT TCAATGAATT 3 60 

CAGAAAAGTT AGCAGGTCAN AAGAAAACAA ATCAAAGACC AGAAT7UVTCC CATTTTAGAT 420 

TGTCGAGTAA ACTANAACAG AAAGAATACC ACTGGAAATT GAATTCCTAC GTANGGGACA 4 80 
TGANTCANTC TA 



(2) INFORMATION FOR SEQ ID NO: 23 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:239: 

TGGAAAGTAT TTAATGATGG GCAACTTGCT GTTTACTTCC TACATATCCC ATCATCTTCT 60 

GTATTTTTTT AAATAACTTT TTTTTGGATT TTTAAAGTAA CCTTATTCTG AGAGGTAACA 12 0 

TGGATTACAT ACTTCTAAGC CATTAGGAGA CTCTATGTTA AACCAAAAGG AAATGTTACT 180 

AGATCTTCAT TTGATCAATA GGATGTGATA ATCATCATCT TTCTGCTCTA ATGGAAAAGT 24 0 

ACTANAAACA TGGAACCATA ATCTTAGATG AACAACGTTA GAATTTGCAC TAATTCTACG 3 00 

GAATTTCAGT AATTCGGCAA ATGTCGGGCA GTGACACAAC ATTTCATGAC GGGGACGCAT 3 60 

CTACCAJICTT CTGGCGATAA GGGCCACCCT TCCCTCTGTA CTTACAGTCC CATTTCATAC 42 0 

ACAGTCTTTG ATTAAATATT CACATTTTTT CTCTACCTAA AGACCTTCAA GACCAGTACG 480 

482 
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(2) INFORMATION FOR SEQ ID NO:240: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 0: 

TGTATCGACG TAGTGGTCTC CCCATGTGAT AGTCTGAAAT ATAGCCTCAT GGGATGAGAG 
GCTGTGCCCC AGCCCGACAC CCGTAAAGGG TCTGTGCTGA GGTGGATTAG TAAAAGAGGA 
AAGCCTTGCA GTTGAGATAG AGGAAGGGCA CTGTCTCCTG CCTGCCCCTG GGAACTGAAT 
GTCTCGGTAT AAAACCCGAT TGTACATTTG TTCAATTCTG AGATAGGAGA AAAACCACCC 
TATGGCGGGA GGCGAGACAT GTTGGCAGCA ATGCTGCCTT GTTATGCTTT ACTCCACAGA 
TGTTTGGGCG GAGGGAAACA TAAATCTGGC CTACGTGCAC ATCCAGGCAT AGTACCTCCC 
TTTGAACTTA ATTATGACAC AGATTCCTTT GCTCACATGT TTTTTTGCTG ACCTTCTCCT 
TATTATCACC CTGCTCTCCT ACCGCATTCC TTGTGCTGAG ATAATGAAAA TAATATCAAT 
AAAAACTTGA NGGAACTCGG AGACCACTAC GTCGATACA 

(2) INFORMATION FOR SEQ ID NO: 241: 

( i ) S EQUENCE CHARACTER I S T I CS : 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

TGTATCGACG TAGTGGTCTC CACTCCCGCC TTGACGGGGC TGCTATCTGC CTTCCAGGCC 
ACTGTCACGG CTCCCGGGTA GAAGTCACTT ATGAGACACA CCAGTGTGGC CTTGTTGGCT 
TGAAGCTCCT CAGAGGAGGG TGGGAACAGA GTGACCGAGG GGGCAGCCTT GGGCTGACCT 
AGGACGGTCA GCTTGGTCCC TCCGCCAAAC ACGAGAGTGC TGCTGCTTGT ATATGAGCTG 
CAGTAATAAT CAGCCTCGTC CTCAGCCTGG AGCCCAGAGA TGGTCAGGGA GGCCGTGTTG 
CCANACTTGG AGCCAGAGAA GCGATTAGAA ACCCCTGAGG GCCGATTACC GACCTCATAA 
ATCATGAATT TGGGGGCTTT GCCTGGGTGC TGTTGGTACC ANGAGACATT ATTATAACCA 
CCAACGTCAC TGCTGGTTCC ANTGCAGGGA AAATGGTTGA TCNAACTGTC CAAGAAAACC 
ACTACGTCCA TACCAATCCA CTAATTGCCN GCCGCCTGCA GGTTCAACCA TATTGGGGAA 
NAACTCCCCN CCGCCGTTTG GGATTGNCAT NAACCTTTGA AATTTTTTCC TATTANTTGT 
CCCCCTAAAATAAACCNTTG, GGCNTTAATC CATTGGGTCC ATANCTTNTT TNCCCGGTTT 
TTAAAANTTG TTTATCCCGC CNCCCNATTT CCCCCCCAAC TTTCCAAAAC CCGAAACCNT 
TNAAATTTNT TNAAACCCTG GGGGGTTCCC NNAATTNNAN TTNAANCTNC C 771 

(2) INFORMATION FOR SEQ ID NO: 242: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single. 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

TGGGCACCTT CAATATCGGG CTCATCGATA ACATCACGCT GCTGATGCTG CTGTTGCTGG 
TCCTCTCTAG GAACCTCTGG ATTTTCAAAT TCTTTGAGGA ATTCATCCAA ATTATCTGCC 
TCTCCTCCTT TCCTCCTTTT TCTAAGGTCT TCTGGTACAA GCGGTCA 

(2) INFORMATION FOR SEQ ID NO:243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA ' 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: 

TTGGGCACCT TCAATATCTA CTGATCTAAA TAGTGTGGTT TGAGGCCTCT TGTTCCTGGC 

TAAAAATCCT TGGCAAGAGT CAATCTCCAC TTTACAATAG AGGTAAAAAT CTTACAATGG 

ATATTCTTGA CAAAGCTAGC ATAGAGACAG CAATTTTACA CAAGGTATTT TTCACCTGTT 

TAATAACAGT GGTTTTCCTA CACCCATAGG GTGCCACCAA GGGAGGAGTG CACAGTTGCA 

GAAACAAATT AAGATACTGA AGACAACACT ACTTACCATT TCCCGTATAG CTAACCACCA 
GTTCAACTGT ACATGTATGT TCTTATGGGC AATCAAGA 

(2) INFORMATION FOR SEQ ID NO:244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:244: 
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TTTTTGGCTC CCATACAGCA CACTCTCATG GGAAATGTCT GTTCTAAGGT CAACCCATAA 

TGCAAAAATC ATCAATATAC TTGAAGATCC CCGTGTAAGG TACAATGTAT TTAATATTAT 

CACTGATACA ATTGATCCAA TACCAGTTTT AGTCTGGCAT TGAATCAAAT CACTGTTTTT 

GTTGTATAAA AAGAGAAATA TTTAGCTTAT ATTTAAGTAC CATATTGTAA GAAAAAAGAT 

GCTTATCTTT ACATGCTAAA ATCATGATCT GTACATTGGT GCAGTGAATA TTACTGTAAA 
AGGGAAGAAG GAATGAAGAC GAGCTAAGGA TATTGAAGGT GCCCAA 

(2) INFORMATION FOR SEQ ID NO: 2 45: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 5: 

ACCAATCCCA CACGGATACT GAGGGACAAG TATATCATCC CATTTCATCC CTACAGCAGC 
AACTTCATGA GGCAGGAGTT ATTAGTCCCA TTTTACAGAA GAGGAAACTG AGACTTAGGG 
AGATCAAGTA ATTTGCCCAG GTCGCACAAT TAGTGATAGA GCCAGGGCTT GAAGCGACGT 
CTGTCTTAAG CCAATGACCC CTGCAGATTA TTAGAGCAAC TGTTCTCCAC AACAGTGTAA 24 0 

GCCTCTTGCT ANAAGCTCAG GTCCACAAGG GCAGAGATTT TTGTCTGTTT TGCTCATTGC 3 00 

TCCTTCCCCA TTGCTTAGAG CAGGGTCTGC CACGAANCAG GTTCTCAATG CATAGTTATT 
AAATGTATAT AAGAGCAAAC ATATGTTACA GAGAACTTTC TGTATGCTTG TCACTTACAT 
GAATCACCTG TGANATGGGT ATGCTTGTTC CCCANTGTTG CAGATNAAGA ' TATTGAANGT 
GCCCAAATCA CTANTTGCGG GCGCCTGCAN GTCCANCATA T 
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(2) INFORMATION FOR SEQ ID NO: 24 6 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

TGGAACCAAT CCAAATACCC ATCAATGATA GACTGGATAA AGAAAATTTG GCACATGTTC 
ACCATGAAAT ACTATGCAGC CATAAAAAAG GATGAGTTCA TATCCTTTGC AGGGACATGG 
ATGAAGCTGG AGACCATCAT TCTCAGCAAA CTAACAAGGG AACAGAAAAC CAAACACTGC 
ATGTTCTCAC TCTTAAGTGG GAGCTGAACA ATGAGAACAC ATGGACACAG GGAGGGGAAC 
ATCACACAGT GGGGCCTGCT GGTGGGTAGG GGTCTAGGGG AGGGATAGCA TTAGGAGAAA 
TACCTAATGT AGATGACGGG TTGATGGGTG CAGCAAACCA CCATGACACG TGTATACCTA 
TGTAACAAAC CTGCATGTTC TGCACATGTA CCCCAGAACT TAAAGTGTTA ATAAAAAAAT 
TAAGAAAAAA GTTAAGTATG TCATAGATAC ATAAAATATT GTANATATTG AAGGTGCCCA 
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^ 482 
(2) INFORMATION FOR SEQ ID NO: 2 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4-74 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

TTCGATACAG GCACAGAGTA AGCAGAAAAA TGGCTGTGGT TTAACCAAGT GAGTACAGTT 60 

AAGTGAGAGA GGGGCAGAGA AGACAAGGGC ATATGCAGGG GGTGATTATA ACAGGTGGTT 12 0 

GTGCTGGGAA GTGAGGGTAC TCGGGGATGA GGAACAGTGA AAAAGTGGCA AAAAGTGGTA 180 

AGATCAGTGA ATTGTACTTC TCCAGAATTT GATTTCTGGN GGAGTCAAAT AACTATCCAG 24 0 

TTTGGGGTAT CATANGGCAA CAGTTGAGGT ATAGGAGGTA GAAGTCNCAG TGGGATAATT 3 00 

GAGGTTATGA ANGGTTTGGT ACTGACTGGT ACTGACAANG TCTGGGTTAT GACCATGGGA 3 60 

ATGAATGACT GTANAAGCGT ANAGGATGAA ACTATTCCAC GANAAAGGGG TCCNAAAACT 42 0 

AAAAANNNAA GNNNNNGGGG AATATTATTT ATGTGGATAT TGAANGTGCC CAAA 4 74 

(2) INFORMATION FOR SEQ ID NO: 24 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 8: 

TTCGATACAG GCT^CATGA ACTGCAGGAG GGTGGTGACG ATCATGATGT TGCCGATGGT 60 

CCGGATGGNC ACGAAGACGC ACTGGANCAC GTGCTTACGT CCTTTTGCTC TGTTGATGGC 12 0 

CCTGAGGGGA CGCAGGACCC TTATGACCCT CAGAATCTTC ACAACGGGAG ATGGCACTGG 18 0 

ATTGANTCCC ANTGACACCA GAGACACCCC AACCACCAGN ATATCANTAT ATTGATGTAG 24 0 

TTCCTGTAGA NGGCCCCCTT GTGGAGGAAA GCTCCATNAG TTGGTCATCT TCAACAGGAT 3 00 

CTCAACAGTT TCCGATGGCT GTGATGGGCA TAGTCATANT TAACCNTGTN TCGAA 355 

(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 9: 

TTGGATTGGT CCTCCAGGAG AACAAGGGGA AAAAGGTGAC CGAGGGCTCC CTGGAACTCA 60 
AGGATCTCCA GGAGCAAAAG GGGATGGGGG AATTCCTGGT CCTGCTGGTC CCTTAGGTCC 12 0 
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ACCTGGTCCT CCAGGCTTAC CAGGTCCTCA AGGCCCAAAG GGTAACAAAG GCTCTACTGG 

ACCCGCTGGC CAGAAAGGTG ACAGTGGTCT TCCAGGGCCT CCTGGGCCTC CAGGTCCACC 

TGGTGAAGTC ATTCAGCCTT TACCAATCTT GTCCTCCAAA AAAACGAGAA GACATACTGA 

AGGCATGCAA GCAGATGCAG ATGATAATAT TCTTGATTAC TCGGATGGAA TGGAAGAAAT 

JSgaccS? ctcaattccc tgaaacaaga catcgagcat atgaaatttc caatgggtac 

434 

(2) INFORMATION FOR SEQ ID NO: 25 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 0: 

TGGATTGGTC ACATGGCAGA GACAGGATTC CAAGGCAGTG AGAGGAGGAT ACAATGCTTC 
TCACTAGTTA TTATTATTTA TTTTATTTTT GAGATGAAGT CTCGCTTTGT CTCCCAGGCT 
GGAGAGCGGT GGTGCGATCT TGGCTCTCTG CAACCCCCGC CTCAAGCAAT TCTCCTGTCT 
TAGCCTCGCG GGTAGATGGA ATTACAGGCG CCCACCGCCA TGCCCAACTA ATTTTTTTGT 
GTCTTCAGTA GAGACAGGGT TTCGCCATGT TGGGCAGGCT GGTCTTGAAC TCCTGACCTC 
NAGTGATCTG CCCTCCTCGG CCTCACAAAG TGCTGGAATT ACAGGCATGG GCTGCTGCAC 
CCAGTCAAC^ TCTCACTAGT TATGGCCTTA TCATTTTCAC CACATTCTAT TGGCCCAAAA 

(2) INFORMATION FOR SEQ ID NO i 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

TGGTACTCCA CCATYATGGG GTCAACCGCC ATCCTCGCCC TCCTCCTGGC TCTTC^ncAA 
GGAGTCTGTG CCGAGGTGCA GCTGRTGCAG TCTGGAGCAG AGGTGAAAAA GTCCGGGGAG 
n^l?!':'^'''' TCTCCTGTAA GGGTTCTGGA TACACCTTTA AGATCTACTG GATCGCCTGG 
GTGCGCCAGT TGCCCGGGAA AGGCCTGGAG TGGATGGGGC TCATCTTTCC TGATGACTCT 
GATACCAGAT ACAGCCCGTC CTTCCAAGGC CAGGTCACCA TCTCAGTCGA TAAGTCCATC 
AGCACCGCCT ATCTGCAGTG GAGTACCAA TAAGTCCATC 

(2) INFORMATION FOR SEQ ID NO:252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:252: 

TGGTACTCCA CTCAGCCCAA CCTTAATTAA GAATTAAGAG GGAACCTATT ACTATTCTCC 
CAGGCTCCTC TGCTCTAACC AGGCTTCTGG GACAGTATTA GAAAAGGATG TCTCAACAAG 
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TATGTAGATC CTGTACTGGC CTAAGAAGTT AAACTGAGAA TAGCATAAAT CAGACCAAAC 18 0 

TTAATGGTCG TTGAGACTTG TGTCCTGGAG CAGCTGGGAT AGGAAAACTT TTGGGCAGCA 24 0 

AGAGGAAGAA CTGCCTGGAA GGGGGCATCA TGTTAAAAAT TACAAGGGGA ACCCACACCA 3 00 

GGCCCCCTTC CCAGCTCTCA GCCTAGAGTA TTAGCATTTC TCAGCTAGAG ACTCACAACT 3 60 

TCCTTGCTTA GAATGTGCCA CCGGGGGGAG TCCCTGTGGG TGATGAGGCT CTCAAGAGTG 42 0 

AGAGTGGCAT CCTATCTTCT GTGTGCCCAC AGGAGCCTGG CCCGAGACTT AGCAGGTGAA 4 80 

GTTTCTGGTC CAGGCTTTGC CCTTGACTCA CTATGTGACC TCTGGTGGAG TACCAA 53 6 

(2) INFORMATION FOR SEQ ID NO:253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 3: 

NTGTTGCGAT CCCAGTAACT CGGGAAGCTG AGGCGGGAGG ATCACCTGAG CTCAGGAGGT 60 

TGAGGCCGCA GTGAGCCGGG ACCACGCCAC TACACTCCAG CCTGGGGCAT AGAGTGAGAC 12 0 

CCTCCAAGAC AGAAAAGAAA AGAAAGGAAG GGAAAGGGAA AGGGAAAAGG AAAAGGAAAA 18 0 

GGAAAAGGAA AAGGAAAAGA CAAGACAAAA CAAGACTTGA ATTTGGATCT CCTGACTTCA 24 0 

ATTTTATGTT CTTTCTACAC CACAATTCCT CTGCTTACTA AGATGATAAT TTAGAAACCC 3 00 

CTCGTTCCAT TCTTTACAGC AAGCTGGAAG TTTGGTCAAG TAATTACAAT AATAGTAACA 3 60 

AATTTGAATA TTATATGCCA GGTGTTTTTC ATTCCTGCTC TCACTTAATT CTCACCACTC 42 0 

TGATATAAAT ACAATTGCTG CCGGGTGTGG TGGCTCATGC CTGTAATCCC GGCACTTTGG 4 80 

GAGACCGAGG TGGGCGGATS GCAACAA 5 07 

(2) INFORMATION FOR SEQ ID NO:254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

TTGGATTGGT CACTGTGAGG AAGCCAAATC GGATCCGAGA GTCTTTTTCT AAAGGCCAGT 60 

ACTGGCCACA CTTTCTCCTG CCGCCTTCCT CAAAGCTGAA GACACACAGA GCAAGGCGCT 12 0 

TCTGTTTTAC TCCCCAATGG TAACTCCAAA CCATAGATGG TTAGCTNCCC TGCTCATCTT 18 0 

TCCACATCCC TGCTATTCAG TATAGTCCGT GGACCAATCC AA 22 2 

(2.) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 base pairs 

(B) TYPE: nucleic acid 
CC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 5: 

TGTTGCGATC CATAAATGCT GAAATGGAAA TAAACAACAT GATGAGGGAG GATTAAGTTG 6 0 

GGGAGGGAGC ACATTAAGGT GGCCATGAAG TTTGTTGGAA GAAGTGACTT TTGAACAAGG 12 0 
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CCTTGGTGTT AAGAGCTGAT GAGAGTGTCC CAGACAGAGG GGCCACTGGT ACAATAGACG 
AGATGGGAGA GGGCTTGGAA GGTGTGCGAA ATAGGAAGGA GTTTGTTCTG GTATGAGTCT 
AGTGAACACA GAGGCGAGAG GCCCTGGTGG GTGCAGCTGG AGAGTTATGC AGAATAACAT 
TAGGCCCTGT GGGGGACTGT AGACTGTCAG CAATAATCCA CAGTTTGGAT TTTATTCTAA 
GAGTGATGGG AAGCCGTGGA AAGGGGGTTA AGCAAGGAGT GAAATTATCA GATTTACAGT 
GATAAAAATA AATTGGTCTG GCTACTGGGG AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO: 2 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 62 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 56: 

TTGGATTGGT CAACCTGCTG AACTCTACYT TTCCTCCTTC TTCCTAAAAA ATTAATGAAT 

CCAATACATT AATGCCAAAA CCCTTGGGTT TTATCAATAT TTCTGTTAAA AAGTATTATC 

CAGAACTGGA CATAATACTA CATAATAATA CATAACAACC CCTTCATCTG GATGCAAACA 

TCTATTAATA TAGCTTAAGA TCACTTTCAC TTTACAGAAG CAACATCCTG TTGATGTTAT 

TTTGATGTTT GGACCAATCC AA i X^Alt^TTAT 

(2) INFORMATION FOR SEQ ID NO:257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

GNGGNNNNNN NNNCAATTCG ACTCNGTTCC CNTGGTANCC GGTCGACATG GCCGCGGGAT 
TACCGGTTGT NNCTGGGGGT GTATGGGGGA CTATGACCGC TTGTAGCTGG GGGTGTATGG 
GGGACTATGA CCGCTTGTAG MTGGKGGTGT ATGGGGGACT ATGACCGCTT GTCGGGTGGT 
CGGATAAACC GACGCAAGGG ACGTGATCGA AGCTGCGTTC CCGCTCTTTC GCATCGGTAG 
GGATCATGGA CAGCAATATC CGCATTCGYC TGAAGGCGTT .CGACCATCGC GTGCTCGATC 
AGGCGACCGG CGACATCGCC GACACCGCAC GCCGTACCGG CGCGCTCATC CGCGGTCCGA 
TCCCGCTTCC CACGCGCATC GAGAAGTTCA CGGTCAACCG TGGCCCGCAC GTCGACAAGA 
AGTCGCGCGA GCAGTTCGAG GTGCGTACCT ACAAGCGGTC A 

(2) INFORMATION FOR SEQ ID NO: 2 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 

TGACCGCTTG TAGCTGGGGG TGTATGGGGG ACTACGACCG CTTGTAGCTG GGGGTGTATG 
GGGGACTATG ACCGCTTGTA GCTGGGGGTG TATGGGGGAC TATGACCGCT TGTAGCTGGG 
GGTGTATGGG GGACTAGGAC CGCTTGTAGC TGGGGGTGTA TGGGGGACTA TGACCGCTTG 
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TAGCTGGGGG TGTATGGGGG ACTACGACCG CTTGTAGCTG GGGGTGTATG GGGGAGTATG 
ACCGCTTGTA NCTGGGGGTG TATGGGGGAC TATGACCGCT TGTGCTGCCT GGGGGATGGG 
AGGAGAGTTG TGGTTGGGGA AAAAAAAAAA AA 

(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



TACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT 
GACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT 
GACCGCTTGT GACCGCTTGT NACNGGGGGT GTCTGGGGGA CTATGANNGA NTGTNACTGG 
GGGTGTCTGG GGGNCTATGA NNGANTGTNA CNGGGGGTGT CTGGGGGACT ATGANNGACT 
GTGCNNCCTG GGGGATCNGA GGAGANTNGN GGNTAGNGAT GGTTNGGGAN A 

(2) INFORMATION FOR SEQ ID NO: 2 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 60: 

TAAGAGGGTA CTGGTTAAAA TACAGGAAAT CTGGGGTAAT GAGGCAGAGA ACCAGGATAC 
TtTGAGGTCA GGGATGAAAA CTAGAATTTT tTTCTTTTTT TTTGCCTGAG AAACTTGCTG 
CTCTGAAGAG GCCCATGTAT TAATTGCTTT GATCTTCCTT TTCTTACAGC CCTTTCAAGG 
GCAGAGCCCT CCTTATCCTG AAGGAATCTT ATCCTTAGCT ATAGTATGTA CCCTCTTA 

(2) INFORMATION FOR SEQ ID NO: 261: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:261: 

TTGGGCACCT TCAATATCAA TAGCTAACAT TTATTGAGTG TTTATCGTAT CATAAAACAC 60 
TGTTCTAAGC CTTTAAACGT ACTAATTCAT TTAATGCTCA TAATCACTTT AGAAGGTGGG ' 12 0 

TACTAGTATT AGTCTCATTT ACAGATGCAA CATGCAGGCA CAGAGAGGTT AATTAACTTG 180 

CCCAAGGTAA CACAGCTAAG AAATAGAAAA AATATTGAAT CTGGAAAGTT GGGCTTCTGG 24 0 

GTAACCCACA GAGTCTTCAA TGAGCCTGGG GCCTCACTCA GTTTGCTTTT ACAAAGCGAA 3 00 

TGAGTAACAT CACTTAATTC AGTGAGTAGG CCAAATGGAG GTCAGCTACG AGTTTCTGCT 3 60 

GTTCTTGCAG TGGACTGACA GATGTTTACA ACGTCTGGCC ATCAGTWAAT GGACTGATTA 42 0 

TCATTGGGAW GTGGGTGGGC TGAATGTTGG CCAGTGAAGT TTATTCAWGC CATATTTTTA 480 

TGTTTAGGAT GACTTTTGGC TGGTCCTAGG GCAAGCTCTG TCTGSCACGG AACACAGAAT 54 0 

WACACAGGGA CCCCCTCAAT TTCTGGTGTG GCTAGAACCA TGAACCACTG GTTGGGGGAA 600 
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CAAGCGGTCA AAACCTAAGT GCGGCCGGCT GGCAGGGTCC ACCCATATGG GGAAAACTCC 
CNACGCGTTT GGAATGCCTN AGCTNGAATT ATTCTAANAG TTGTCCNCNT AAAATTAGCC 
TGGGCGTTAA TCANGGGTCN NAAGCC 

(2) .INFORMATION FOR SEQ ID NO : 2 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

TGACCGCTTG TCATCTCACA TGGGGTCCTG CACGCTTTTG CCTTTGTAGG AAACCTGACA 
TTTGTCTGTT TCTTCTTTCT CTTTTCCTTC CCATATCCTC CTAATTTACG TTTGACTTGT 
TTGCTGAGGA GGCAGGAGCT . AGAGACTGCT GTGAGCTCAT AGGGGTGGGA AGTTTATCCT 
TCAAGTCCCG CCCACTCATC ACTGCTTCTC ACCTTCCCCT GACCAGGCTT ACAAGTGGGT 
TCTTGCCTGC TTTCCCTTTG GACCCAACAA <7CCCCTGTAA TGAGTGTGCA TGACTCTGAC 
AGCTGTGGAC TCAGGGTCCT TGGCTACAGC TGCCATGTAA AATATCTCAT CCAGTTCTCG 
CAAATTGTTA AAATAACCAC ATTTCTTAGA TTCCAGTACC CAAATCATGT CTTTACGAAC 
TGCTCCTCAC ACCCAGAAGT GGCACAATAA TTCTTGGGGA ATTATTACTT TTTTTTTTCT 
CTCTNTTNNC GNNNGNNNNG GNNNGNCCAG GAATTACCAC NTTGGAAGAC CTGGCCNGAA 
TTTATTATAN AG.GGGAGCCG ATTNTTTTTC CTAACACAAA GCGGGTCA 

(2) INFORMATION FOR SEQ ID NO:263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

TTTTTTTTTT TTTGGCCTGA GCAAGTGAAA TTATGAAATT TCCATATACT CAAAAGAGTA 
AGACTGCAAA AAGATTAAAT GTAAAAGTTG TCTTGTATAC AGTAATGTTT AAGATACCTA 
TTANATTTAT AAATGGAAAA TTAGGGCATT TGGATATACA AGTTGAAAAT TCAGGAGTGA 
GGTTGGGCTG GCTGGGTATA TACTGAAAAC TGTCAGTACA CAGATGACAT CTAAAACCAC 
AAATCTGGTT TTATTTTAGC AGTGATATGT GTCACTCCCA CAAAAGCCTT CCCAATTGGC 
CTCAGCATAC ACAACAAGTC ACCTCCCCAC AGCCCTCTAC ACATAAACAA ATTCCTTAGT 
TTAGTTCAGG AGGAAATGCG CCCTTTTCCT TCCGCTCTAG GTGACCGCAA GGCCCAGTTC 
TCGTCACCAA GATGTTAAGG GAAGTCTGCC AAAGAGGCAT CTGAAAGGAA ATAAGGGGAA 
TGGGAGTGAC CACAAAGGAA AGCCAAGGAJ^ AAACTTTGGA GACCGTTTCT AGANCCCTGG 
CATTTCACAA CAAAACTCNG GAACAAACCT TGTCTCATCA ATCATTTAAG CCCTTCGTTT 
GGANNAGACT TTCTGAACTG GGCGCTGAAC ATAANCCTCA TTGAATGTCT TCACAGTCTC 
CCAGCTGAAG GCACACCTTG GGCCAGAAGG GGAATCTTCC AGGTCCTCAA NACAGGGCTC 
GCCCTTTGNC 

(2) INFORMATION FOR SEQ ID NO: 2 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



660 
720 
746 



60 
120 
180 
240 
300 
360 
420 
480 
540 
588 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
730 



BNSDOCID: <WO 9845328A2„L> 



wo 98/45328 



PCT/US98/06939 



127 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: 



TTAGGTCATT 


60 


TATTTCTCCC 


120 


AATTTTTTAG 


180 


TTTGTGGTTC 


240 


AATATCATAT 


300 


TTCCATCCTA 


360 


TGGGAGGTTT 


420 


TAAATNTCTG 


480 


GAATCTGAAG 


540 


ATTAAGTCTT 


600 


GTTAAANTTT 


660 


TCNCC 


715 



(2) INFORMATION FOR SEQ ID NO: 2 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 65: 

"^TTTTTTTTT TTTCCCAACA CAAAGCACCA TTATCTTTCC TCACAATTTT CAACATAGTT 6 0 

TGATTCCCAT GAAGAGGTTA TGATTTCTAA AGAAAACATG GCTACTATAC TATCAATCAG 12 0 

GGTTAAATCT TTTTTTTTTG AGACGGAGTT TA 15 2 

(2) INFORMATION FOR SEQ ID NO: 2 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 

TAAACTCCGT CCCCTTCTTA ATCAATATGG AGGCTACCCA CTCCACATTA CCTTCTTTTC 6 0 

AAGGGACTGT TTCCGTAACT GTTGTGGGTA TTCACGACCA GGCTTCTAAA CCTCTTAAAA 12 0 

CTCCCCAATT CTGGTGCCAA CTTGGACAAC ATGCTTTTTT ttTTTTTTTT TTTTTTTTTN 
GAGACGGAGT TTA 



(2) INFORMATION FOR SEQ ID NO: 2 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 



180 
193 
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TGTTGCGATC CCTTAAGCAT GGGTGCTATT AAAAAAATGG TGGAGAAGAA AATACCTGGA 
ATTTACGTCT TATCTTTAGA GATTGGGAAG ACCCTGATGG AGGACGTGGA GAACAGCTTC 
TTCTTGAATG TCAATTCCCA AGTAACAACA GTGTGTCAGG CACTTGCTAA GGATCCTAAA 
TTGCAGCAAG GCTACAATGC TATGGGATTC TCCCAGGGAG GCCAATTTCT GAGGGCAGTG 
GCTCAGAGAT GCCCTTCACC TCCCATGATC AATCTGATCT CGGTTGGGGG ACAACATCAA 
GGTGTTTTTG GACTCCCTCG ATGCCCAGGA GAGAGCTCTC ACATCTGTGA CTTCATCCGA 
AAAACACTGA ATGCTGGGGC GTACTCCAAA GTTGTTCAGG AACGCCTCGT GCAAGCCGAA 
TACTGGCATG ACCCATAAAA GGAGGATGTG GATCGCAACA 

(2) INFORMATION FOR SEQ ID NO: 2 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 68: 

TGTTGCGATC CGTTGATAGA ATAGCGACGT GGTAATGAGT GCATGGCACG CCTCCGACTT 
ACCTTCGCCC GTGGGGACCC CGAGTACGTC TACGGCGTCG TCACTTAGAG TACCCTCTGG 
ACGCCCGGGC GCGTTCGATT TACCGGAAGC GCGAGCTGCA GTGGGCTTGC GCCCCCGGCC 
AAATTCTTTG GGGGGTTTAA GGCCGCGGGG AATTTGAGGT ATCTCTATCA GTATGTAGCC 
AAGTTGGAAC AGTCGCCATT CCCGAAATCG CTTTCTTTGA ATCCGCACCG CCTCCAGCAT 
TGCCTCATTC ATCAACCTGA AGGCACGCAT AAGTGACGGT TGTGTCTTCA GCZ^GCTCCAC 
TCCATAACTA GCGCGCTCGA CCTCGTCTTC GTACGCGCCA GGTCCGTGCG TGCGAATTCC 
CAACTCCGGT GAGTTGCGCA TTTCAAGTTN CGAAACTGTT CGCCTCCACN ATTTGGCATG 
TTCACGCATG ACACGGAATA AACTCGTCCA GTACCGGGAA TGGGATCGCA ACA 

(2) INFORMATION FOR SEQ ID NO: 2 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 69: 
TTTTTTTTTT TTCGCCTGAA TTAGCTACAG ATCCTCCTCA CAAGCGGTCA 
(2) INFORMATION FOR SEQ ID NO: 27 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 0: 

TGTTGCGATC CAAATAACCC ACCAGCTTCT TGCACACTTC GCAGAAGCCA CCGTCCTTTG 
GCTGAGTCAC GTGAACGGTC AGTGCAAGCA GCCGCGTGCC AGAGCAGAGG TGCAGCATGC 
TGCACACCAG CTCAGGGCTG ACCTCCTCCA GCAGGATGGA CAGGATGGAG CTGCCGTACG 
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TGTCCACCAC CTCCTGGCAC TCTTCCGACA GGGACTTCGG CAGCTTCGAG CACATTTTGT 
CAAAAGCGTC GAGTATTTCT TTCTCAGTCT TGTTGTTGTC AATCAGCTTG GTCACCTCCT 
TCACCAGGAA TTCACACACC TCACAGTAAA CATCAGACTT TGCTGGGACC TCGTGCTTCT 
TAATGGGCTC CACCAGTTCC AGGGCAGGGA TGACATTCTT GGAGGCCACT TTGGCGGGGA 
CCAGAGTCTG CATGGGCATC TCTTTCACCT CATCACAGAA CCCAACCAGC GCACAGATCT 
CCTTGGGTTG CATGTGCATC ATCATCTGGG ATCGCAACA 

(2) INFORMATION FOR SEQ ID NO: 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 base pairs 

(B) .TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 



TTTTTTTTTT TTCGGGCGGC GACCGGACGT GCACTCCTCC AGTAGCGGCT GCACGTCGTG 

CCAATGGCCC GCTATGAGGA GGTGAGCGTG TCCGGCTTCG AGGAGTTCCA CCGGGCCGTG 

GAACAGCACA ATGGCAAGAC CATTTTCGCC TACTTTACGG GTTCTAAGGA CGCCGGGGGG 

AAAAGCTGGT GCCCCGACTG CGTGCAGGCT GAACCAGTCG TACGAGAGGG GCTGAAGCAC 

ATTAGTGAAG GATGTGTGTT CATCTACTGC CAAGTAGGAG AAGAGCCTTA TTGGAAAGAT 

CCAAATAATG ACTTCAGAAA AAACTTGAAA GTAACAGCAG TGCCTACACT ACTTAAGTAT 

GGAACACCTC AAAAACTGGT AGAATCTGAG TGTCTTCAGG CCAACCTGGT GGAAATGTTG 
TTCTCTGAAG ATTAAGATTT TAGGATGGCA ATCAAGA 

(2) INFORMATION FOR SEQ ID NO: 2 72: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 



TTTTTTTTTT TTGGGCAACA ACCTGAATAC CTTTTCAAGG CTCTGGCTTG GGCTCAAGCC 
CGCAGGGGAA ATGCAACTGG CCAGGTCACA GGGCAATCAA GA 

(2) INFORMATION FOR SEQ ID NO:273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 

TTTTTTTTTT TTGGCAATCA ACAGGTTTAA GTCTTCGGCC GAAGTTAATC TCGTGTTTTT 
GGCAATCAAC AGGTTTAAGT CTTCGGCCGA AGTTAATCTC GTGTTTTTGG CAATCAACAG 
GTTTAAGTCT TCGGCCGAAG TTAATCTCGT GTTTTTGGCA ATCAACAGGT TTAAGTCTTC 
GGCCGAAGTT AATCTCGTGT TTTTGGCAAT CAACAGGTTT AAGTCTTCGG CCGAAGTTAA 
TCTCGTGTTT TTGGCAATCA ACAGGTTTAA GTCTTCGGCC GAAGTTAATC TCGTGTTTTT 
GGCAATCAAG AGGTTTAAGT CTTCGGCCGA AGTTAATCTC GTGTTTTTGG CAATCAACAG 
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GTTTAAGTCT TCGGCCGAAN TTAATCTCGT rTTTTTr-or.n nrr,^. 

GGCCGAAGTT AATCTCGTGT TT^JgSSJ 0^ '^'^ ATCAACAGGT TTAANTCTTC 

(2) INFORMATION FOR SEQ ID NO: 2 74: 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:274: 

?G™cI S?c°cSSS cL"?c^ ^—-C -C 

TCCCTGGGAT GCAAGGcSg Sc^cJJJ^ ^^^^^^^A CCATGATCAA GTGGGCTTCA 
ACAGAACCAA AG^SS^^ Sc^SaJJJ r^J^^^l ^J^^^ CATCACATAA 
AATTCAACAG CCCTTCATGC TAAACACTCT TAa^rJIJ^^ TGCAGAAAAG GCCTTGGACA 
CAAAATAATA AGAGCTATTT l^C^rr r^lT^^ GATATTGATG GAATGTATCT 
ACTGGAAGCA TTCCC^lol ^CT^lc ^^r..^'' ^^^^^^^^^^ ATGGGCAAAG 
ATTCAACATA GTATTGGA^S ^JJJJcCCaS SSc^^JSJ -CCCTCTCT CACCGCTCCT 

(2) INFORI4ATION FOR SEQ ID NO:275: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 75: 

SJo^S oc^JSSSIS ™- ™- 

sss^ ™s s^sss? 

NCMTAAGCA G^^JSS GA^Z ^ISSJ"™ °™°S'^CCC CATOAGTGAA 
CGCTTTCTNT CGTGtJSJI tJS^^Sc SJ^Sg ^JSc^JSS! C™AN« 
ATTTGGGAGi ACTCCCCCCN CGTTO<-»Tr^^ ^^^5:™ CGGCCCCCTG CMGGTCCACC 

ACCNGNCTTG cSJS^S cS^gSS f^^^^^^^™ 

NCCGCCNCCC NAATTCCCAC rrua Zi^rn 7. a CTGNNGTNAA AATNNGTTTT 

ANGCCCNGNG G^^TcScN^ S^JSS rS^'^''^^ AAGGCCTTCN NAAGTGTTTA 

TCGCGCCCTG GNCNCGcSn G^J^SSJ^ SSSSS^^ CNCTTNNI^T TNCGNGTTGG 

NNNCTNTTCC TNNNACtSc TNGcSSrr CCTNGNTCNN NGGCNCNTCN 

TNTGTNNCC ^""^^^^^ TNGCCTNTCC NCNCCGNGGN NCANNGCACA TTNCNCNNAC 



(2) INFORMATION FOR SEQ ID NO: 2 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO:2 76: 
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TGACCTGACA TGTAGTAGAT ACTTAATAAA TATTTGTGGA ATGAATGGAT GAAGTGGAGT 6 0 

TACAGAGAAA AATAGAAAAG TACAAATTGT TGTCAGTGTT TTGAAGGAAA ATTATGATCT 12 0 

TTCCCAAAGT TCTGACTTCA TTCTAAGACA GGGTTAGTAT CTCCATACAT AATTTTACTT 18 0 

GCTTTTGAAA ATCAAATGAG ATAATCTATT TAGATTGATA ATTTATTTAG ACTGGCTATA 24 0 

AACTATTAAG TGCTAGCAAA TATACATTTT AATCTGATTT TCCACCTCTT GTGATATAGC 300 

TATGTAGGTG TTGACTTTAA TGGATGTCAG GTCAATCCC 339 

(2) INFORMATION FOR SEQ ID NO: 2 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 664 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

TGACCTGACA TCCATAACAA AATCTTTCTC CATTATATTC TTCTAGGGGA ATTTCTTGAA 60 

AAGCATCCAA AGGAAACAAA TGATGGTAAG ACCGTGCCAA GTGGGGAGCA GACACCAAAG 12 0 

TAAGACCACA GATTTTACAT TCAACAGGTA GCTCACAGTA CTTTGCCCGA CACTGTGGGC 180 

AGAAATAGCC TCCTAATGTA AGCCCTGGCT CAGTATTGCC ATCCAAATGC GCCATGCTGA 240 

AAGAGGGTTT TGCATCCTGG TCAGATNAAG AAGCAATGGT GTGCTGAGGA AATCCCATAC 3 00 

GAATAAGTGA GCATTCAGAA CTTGAGCTAG CAGGAGGAGG ACTAAGATGA TGTGTGAGCA 3 60 

ACTCTTTGTA ATGGCTTTCA TCTAAAATAA CATGGTACGT GCCACCAGTT TCACGAGCAA 42 0 

GTACAGTGCA AACGCGAACT TCTGCAGACA ATCCAATAAC AGATACTCTA ATTTTAGCTG 4 80 

CCTTTAGGGT CTTGATTAAA TCATAAATAT TAGATGGATC GCAAGTTGTA AGGNTGCTAA 54 0 

AAGATGATTA GTACTTCTCG ACTTGTATGT CCAGGCATGT TGTTTTAAAN TCTGCCTTAG 600 

NCCCTGCTTA GGGGAATTTT TAAAGAAGAT GGCTCTCCAT GTTCANGGTC AATCACNAAT 660 

664 

(2) INFORMATION FOR SEQ ID NO: 278.: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:278: 

TGACCTGACA TTGAGGAAGA GCACACACCT CTGAAATTCC TTAGGTTCAG AAGGGCATTT 60 

GACACAGAGT GGGCCTCTGA TAATTCATGA AATGCATTCT GAAGTCATCC AGAATGGAGG 12 0 

CTGCAATCTG CTGTGCTTTG GGGGTTGCCT CACTGTGCTC CTGGATATCA CACAAAAGCT 180 

GCAATCCTTC TTCTTCAACT AACATTTTGC AGTATTTGCT GGGATTTTTA CTGCAGACAT 24 0 

GATACATAGC CCATAGTGCC CAGAGCTGAA CCTCTGGTTG AGAGAAGTTG CCAAGGAGCG 3 00 
GGAAAAATGT CTTGAAAGAT CTATAGGTCA CCAATGCTGT CATCTTACAA CTTGAACTTG - 3 60 

GCCAATTCTG TATGGTTGCA TGCAGATCTT GGAGAAGAGT ACGCCTCTGG AAGTCACGGG 42 0 

ATATCCAAAN CTGTCTGTCA GATGTCAGGT CA 4 52 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 279: 

"^T^^TTTTTT TTCGGCAAGG CAAATTTACT TCTGCAAAAG GGTGCTGCTT GCACTTTTGG 
CCACTGCGAG AGCACACCAA ACAAAGTAGG GAAGGGGTTT TTATCCCTAA CGCGGTTATT 
CCCTGGTTCT GTGTCGTGTC CCCATTGGCT GGAGTCAGAC TGCACAATCT ACACTGACCC 
AACTGGCTAG TGTTTAAAAT TGAATATGAA TAATTAGGTA GGAAGGGGGA GGCTGTTTGT 
TACGGTACAA GACGTGTTTG GGCATGTCAG GTCA 

(2) INFORMATION FOR SEQ ID NO: 2 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE- DESCRIPTION: SEQ IDNO:280: 

TACCTGACAT GGAGAAATAA CTTGTAGTAT TTTGCGTGCA ATGGAATACT ATATGAGGGT 
GAAAATGAAT GAACTAGCAA TGCGTGTATC AACATGAATA AATCCCCAAA ACATAATAAT 
GTTGAATGGA AAAGGTGAGT TTCAGAAGGA TATATATGCC CTCTAAATCC ATTTATGTAA 
ACCTTTAAAA AACTACATTA TTTATGGTCA TAAGTCCATC CAGAAAATAT TTAAAAACCT 
ACATGGGATT GATAACTACT GATGTCAGGT CA 

(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 81: 

TTTTTTTTTT TTGGCCAATA GCATGATTTA AACATTGGAA AAAGTCAAAT GAGCAATGCG 
AATTTTTATG TTCTCTTGAA TAATCAAAAG AGTAGGCAAC ATTGGTTCCT CATTCTTGAA 
TAGCATTAAT CAGAAAATAT TGCATAGCCT CTAGCCTCCT TAGAGTAGGT GTGCTCTCTC 
AAATATATCA TAGTCCCACA GTTTATTTCA TGTATATTTT CTGCCTGAAT CACATAGACA 
TTTGAATTTG CAACGCCTGA TGTAAATATA TAAATTCTTA CCAATCAGAA ACATAGCAAG 
AAATTCAGGG ACTTGGTCAT YATCAGGGTA TGACAGCANA TCCCTGTARA AACACTGATA 
CACACTCACA CACGTATGCA ACGTGGAGAT GTCGCYTTWW KKKTWYWCWM RMRYCRWCGN 42 0 

AATCACTTAN N 

431 

(2) INFORMATION FOR SEQ ID NO: 2 82: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 
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ATTCGATTCG ATGCTTGAGC CCAGGAGTTC AAGACTGCAG TGAGCCACTG CACTTCAGGC 60 
TGGACAACAG AGCGAGTCCC TGTGCCAAAA AAAAAAAA 9 8 

(2) INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 83: 

■^TTTTTTTTT TTCGCAAGCA CGTGCACTTT ATTGAATGAC ACTGTAGACA GGTGTGTGGG 60 

TATAAACTGC TGTATCTAGG GGCAGGACCA AGGGGGCAGG GGCAACAGCC CCAGCGTGCA 12 0 

GGGCCASCAT TGCACAGTGG ASTGCAAAGG TTGCAGGCTA TGGGCGGCTA CTAVTAACCC 18 0 

CGTTTTTCCT GTATTATCTG TAACATAATA TGGTAGACTG TCACAGAGCC GAATWCCART 24 0 

HACASGATGA ATCCAAWGGT CAYGAGGATG CCCASAATCA GGGCCCASAT STTCAGGCAC 3 00 

TTGGCGGTGG GGGCATASGC CTGKGCCCCG GTCACGTCSC CAACCWTCTY CCTGTCCCTA 360 

CMCTTGAWTC CNCNCCTTNN NNTNCCNTNA TNTGCCCGCC CNCCTCCTNG NGTCAACCNG 42 0 

NATCTGCACT ANCTCCCTCN CCCCTTNTGG ANTCTCNTCC TTCAANTAAN NTTATCCTTN 480 

ACNCCCCCCT CNCCTTTCCC CTNCCNCCCN TNATCCCNGN NCCNCTATCA NTCNTNCCCT 54 0 

CNCTNTNCTN CNNATCGTTC CNCCTNNTAA CTACNCTTTN NACNANNCCT CACTNATNCC 60 0 

NGNNANTTCT TTCCTTCCCT CCCNACGCNN TGCGTGCGCC CGTCTNGCCT NNNCTNCGNA 660 

CCCNNACTTT ATTTACCTTT NCACCCTAGC NCTCTACTTN ACCCANCCNC TCCTACCTCC 72 0 

NGGNCCACCC NNCCCTNATC NCTNNCTCTN TCNNCTCNTT CCCC 764 

(2) INFORMATION FOR SEQ ID NO: 2 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:284: 

CAAGTGTAGG CACAGTGATG AAAGCCTGGA GCAAACACAA TCTGTGGGTA ATTAACGTTT 60 
ATTTCTCCCC TTCCAGGAAC GTCTTGCATG GATGATCAAA GATCAGCTCC TGGTCAACAT 12 0 

AAATAAGCTA GTTTAAGATA CGTTCCCCTA CACTTGA 15 7 

(2) INFORMATION FOR SEQ ID NO:285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:285: 

ATTCGATTGT ACTCAGACAA CAATATGCTA AGTGGAAGAA GTCAGTCACA AAAGACCACA 6 0 

TACTGTATGA CTTCATTTAC ATTAAGTGTC CAGAATAGGC AAATCCGTAG AGACAGAAAG 12 0 

TAGATGAGCA GCTGCCTAGG TCTGAGTACA 15 0 
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(2) INFORMATION FOR SEQ ID NO: 2 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

ATTCGATTTT TTTTTTTTTG GCCATGATGA AATTCTTACT CCCTCAGATT TTTTGTCTGG 
ATAAATGCAA GTCTCACCAC CAGATGTGAA ATTACAGTAA ACTTTGAAGG AATCTCCTGA 
GCAACCTTGG TTAGGATCAA TCCAATATTC ACCATCTGGG AAGTCAGGAT GGCTGAGTTG 
CAGGTCTTTA CAAGTTCGGG CTGGATTGGT CTGAGTACA 

(2) INFORMATION FOR SEQ ID NO: 2 87: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO:287: 

ATTCGATTCT TGAGGCTACC AGGAGCTAGG AGAAGAGGCA TGGAACAAAT TTTCCCTCAT 
ATCCATACTC AGAAGGAACC AACCCTGCTG ACACCTTAAT TTCAGCTTCT GGCCTCTAGA 
ACTGTGAGAG AGTACATTTC TCTTGGTTTA AGCCAAGAGA ATCTGTCTTT TGGTACTTTA 
TATCATAGCC TCAAGA 

(2) INFORMATION FOR SEQ ID NO: 28 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 8 8 : 

ATTCGATTTC AGTCCAGTCC CAGAACCCAC ATTGTCAATT ACTACTCTGT ARAAGATTCA 
TTTGTTGAAA TTCATTGAGT AAAACATTTA TGATCCCTTA ATATATGCCA ATTACCATGC 
TAGGTACTGA AGATTCAAGT GACCGAGATG CTAGCCCTTG GGTTCAAGTG ATCCCTCTCC 
CAGAGTGCAC TGGACTGAA 



(2) INFORMATION FOR SEQ ID NO; 2 89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 89: 
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ATTCGATTCT TGAGGCTACA AACCTGTACA GTATGTTACT CTACTGAATA CTGTAGGCAA 60 

TAGTAATACA GAAGCAAGTA TCTGTATATG TAAACATTAA AAAGGTACAG TGAAACTTCA 12 0 

GTATTATAAT CTTAGGGACC ACCATTATAT ATGTGGTCCA TCATTGGCCA AAAAAAAAAA 18 0 
AA 



182 

(2) INFORMATION FOR SEQ ID NO: 2 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GGCACGAGGA GAAATGTAAT TCCATATTTT ATTTGAAACT TATTCCATAT TTTAATTGGA 60 

TATTGAGTGA TTGGGTTATC AAACACCCAC AAACTTTAAT TTTGTTAAAT TTATATGGCT 12 0 

TTGAAATAGA AGTATAAGTT GCTACCATTT TTTGATAACA TTGAAAGATA GTATTTTACC 18 0 

ATCTTTAATC ATCTTGGAAA ATACAAGTCC TGTGAACAAC CACTCTTTCA CCTAGCAGCA 24 0 

TGAGGCCAAA AGTAAAGGCT TTAAATTATA ACATATGGGA TTCTTAGTAG TATGTTTTTT 3 00 

TCTTGAAACT CAGTGGGTCT ATCTAACCTT ACTATCTCCT CACTCTTTCT CTAAGACTAA 3 60 

ACTCTAGGCT CTTAAAAATC TGCCCACACC AATCTTAGAA GCTCTGAAAA GAATTTGTCT 42 0 

TTAAATATCT TTTAATAGTA ACATGTATTT TATGGACCAA ATTGACATTT TCGACTATTT 4 80 

TTTCCAAAAA AGTCAGGTGA ATTTCAGCAC ACTGAGTTGG GAATTTCTTA TCCCAGAAGA 5^0 

CCAACCAATT TCATATTTAT TTAAGATTGA TTCCATACTC CGTTTTCAAG GAGAATCCCT 600 

GCAGTCTCCT TAAAGGTAGA ACAAATACTT TCTATTTTTT TTTCACCATT GTGGGATTGG 6 60 

ACTTTAAGAG GTGACTCTAA AAAAACAGAG AACAAATATG TCTCAGTTGT ATTAAGCACG 72 0 

GACCCATATT ATCATATTCA CTTAAAAAAA TGATTTCCTG TGCACCTTTT GGCAACTTCT ' 7 80 

CTTTTCAATG TAGGGAAAAA CTTAGTCACC CTGAAAACCC ACAAAATAAA TAAAACTTGT 84 0 

AGATGTGGGC AGAAGGTTTG GGGGTGGACA TTGTATGTGT TTAAATTAAA CCCTGTATCA 90 0 

CTGAGAAGCT GTTGTATGGG TCAGAGAAAA TGAATGCTTA GAAGCTGTTC ACATCTTCAA 960 

GAGCAGAAGC AAACCACATG TCTCAGCTAT ATTATTATTT ATTTTTTATG CATAAAGTGA 102 0 

ATCATTTCTT CTGTATTAAT TTCCAAAGGG TTTTACCCTC TATTTAAATG CTTTGAAAAA 108 0 

CAGTGCATTG ACAATGGGTT GATATTTTTC TTTAAAAGAA AAATATAATT ATGAAAGCCA 114 0 

AGATAATCTG AAGCCTGTTT TATTTTAAAA CTTTTTATGT TCTGTGGTTG ATGTTGTTTG 12 0 0 

TTTGTTTGTT TCTATTTTGT TGGTTTTTTA CTTTGTTTTT TGTTTTGTTT TGTTTTGTTT 1260 

KGCATACTAC ATGCAGTTCT TTAACCAATG TCTGTTTGGC TAATGTAATT AAAGTTGTTA 132 0 

ATTTATATGA GTGCATTTCA ACTATGTCAA TGGTTTCTTA ATATTTATTG TGTAGAAGTA 13 8 0 

CTGGTAATTT TTTTATTTAC AATATGTTTA AAGAGATAAC AGTTTGATAT GTTTTCATGT 144 0 

GTTTATAGCA GAAGTTATTT ATTTCTATGG CATTCCAGCG GATATTTTGG TGTTTGCGAG 15 0 0 

GCATGCAGTC AATATTTTGT ACAGTTAGTG GACAGTATTC AGCAACGCCT GATAGCTTCT 1560 

TTGGCCTTAT GTTAAATAAA AAGACCTGTT TGGGATGTAT TTTTTATTTT TAAAAAAAAA 162 0 

AAAAAAAAAA AAAAAAAAAA AAAAAA lg4g 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 91: 
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' CACATGAGTA 






X ^ U 


TTGTTPTGnA 


X o u 




^ ^ U 


AGATPTTGAn 


"inn 
J u u 


CCAPAAPATA 


"5 C A 

J 0 u 


GCTTTTTTPT 

>^Vi> X X X X X X ^ i. 


A r> 

rt ^ U 


AGTGGTGTGA 


AAA 
ft 0 U 


TTGGPPATTA 


D ft U 


TCTGGAATAT 


<^ 0 n 
0 *J u 


CATTGTAPGG 


0 0 u 


CGTPTGTPPA 

^ w X ^ X w X Vi. ^.n. 


"7 0 0 


G PA GT P PTP T 


"7 0 n 


TPPTTTPTHP 

X ^_ \_ X X X \m X \J\3 


A n 
0 ft u 


TGGAPGTGGT 


Q n n 
J u u 


TTGPTPTTGP 

X X w \— X k_ X X w 


you 


TCPPAAGPGT 


1 n 0 n 
X u ^ u 


> — — X *> — >— V Vw. \_ X 


T n n n 
X u 0 u 


GTATAPTPPT 

w X X X ^ X 


X Xft u 


CTGTPPAPPP 

W X W X W V_ 


1 A n 
X ^ u u 


TTCAGPPAGA 


X ^ 0 VJ 


A T" P A 2i Zi T" 


T "2 n 


ATAACAAAAT 


1380 


CCCGAAGAAG 


1440 


CGGACAGGAR 


1500 


GCACACGGTG 


1560 


AATGTGATAA 


1620 


AGAAAAGGCA 


1680 


ATGCTTCTAA 


1740 


TTATTACTTT 


1800 


C 


1851 



(2) INFORMATION FOR SEQ ID NO:292; 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 92 



GTCAGGTTTT 


CTGGGAATCC 


CACATGAGTA 


60 


CATAAATCTT 


CTAGCTCTGG 


CTGGCTGTTT 


120 


TCTGATGTCA 


GCTTTAAGTC 


TTGTTCTGGA 


180 


TGTTTTTCTT 


TGTAGTCAGA 


AAGTAACTGG 


240 


TACTCTCTGG 


CCGTCTTTCC 


AGATCTTGAG 


300 


GGCTGACTAT 


ACTTGCTGAT 


CCACAACATA. 


360 


TATCCAGCGC 


ATTTAAATTC 


GCTTTTTTCT 


420 


GCTCATGTAT 


ACCAAGTAGC 


AGTGGTGTGA 


480 


CACCGTATAA 


GAGCAGTGCT 


TTGGCCATTA 


540 


GAGTGGTATT 


TCCATACTCA 


TCTGGAATAT 


600 


ACGCACATTC 


ATCTTCCTGG 


CATTGTACGG 


660 


CAAGGACATT 


AAGTTGACAT 


CGTCTGTCCA 


720 


TGGCAGAGGC 


CAGATGTAGA 


GCAGTCCTCT 


780 
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TTTGCTTGTC CCTCTTGTTC ACATCCGTGT CCCTGAGCAT GACGATGAGA TCCTTTCTGG 84 0 

GGACTTTACC CCACCAGGCA GCTCTGTGGA GCTTGTCCAG ATCTTCTCCA TGGACGTGGT 90 0 

ACCTGGGATC CATGAAGGCG CTGTCATCGT AGTCTCCCCA AGCGACCACG TTGCTCTTGC 960 

CGCTCCCCTG CAGCAGGGGA AGCAGTGGCA GCACCACTTG CACCTCTTGC TCCCAAGCGT 102 0 

CTTCACAGAG GAGTCGTTGT GGTCTCCAGA AGTGCCCACG TTGCTCTTGC CGCTCCCCCT 10 8 0 

GTCCATCCAG GGAGGAAGAA ATGCAGGAAA TGAAAGATGC ATGCACGATG GTATACTCCT 114 0 

CAGCCATCAA ACTTCTGGAC AGCAGGTCAC TTCCAGCAAG GTGGAGAAAG CTGTCCACCC 12 00 

ACAGAGGATG AGATCCAGAA ACCACAATAT CCATTCACAA ACAAACACTT TTCAGCCAGA 12 60 

CACAGGTACT GAAATCATGT CATCTGCGGC AACATGGTGG AACCTACCCA ATCACACATC 13 2 0 

AAGAGATGAA GACACTGCAG TATATCTGCA CAACGTAATA CTCTTCATCC ATAACAAAAT 13 8 0 

AATATAATTT TCCTCTGGAG CCATATGGAT GAACTATGAA GGAAGAACTC CCCGAAGAAG 14 4 0 

CCAGTCGCAG AGAAGCCACA CTGAAGCTCT GTCCTCAGCC ATCAGCGCCA CGGACAGGAR 15 0 0 

TGTGTTTCTT CCCCAGTGAT GCAGCCTCAA GTTATCCCGA AGCTGCCGCA GCACACGGTG 15 6 0 

GCTCCTGAGA AACACCCCAG CTCTTCCGGT CTAACACAGG CAAGTCAATA AATGTGATAA 162 0 

TCACATAAAC AGAATTAAAA GCAAAGTCAC ATAAGCATCT CAACAGACAC AGAAAAGGCA 168 0 

TTTGACAAAA TCCAGCATCC TTGTATTTAT TGTTGCAGTT CTCAGAGGAA ATGCTTCTAA 174 0 

CTTTTCCCCA TTTAGTATTA TGTTGGCTGT GGGCTTGTCA TAGGTGGTTT TTATTACTTT 18 00 

AAGGTATGTC CCTTCTATGC CTGTTTTGCT GAGGGTTTTA ATTCTCGTGC C . 1851 

(2) INFORMATION FOR SEQ ID NO:293: 

(i) SEQUENCE Cf^ARACTERISTICS : 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:293: 

CTTGAGCTTC CAAATAYGGA AGACTGGCCC TTACACASGT CAATGTTAAA ATGAATGCAT 6 0 

TTCAGTATTT TGAAGATAAA ATTRGTAGAT CTATACCTTG TTTTTTGATT CGATATCAGC 12 0 

ACCRTATAAG AGCAGTGCTT TGGCCATTAA TTTATCTTTC ATTRTAGACA GCRTAGTGYA 18 0 

GAGTGGTATT TCCATACTCA TCTGGAATAT TTGGATCAGT GCCATGTTCC AGCAACATTA 24 0 

ACGCACATTC ATCTTCCTGG CATTGTACGG CCTGTCAGTA TTAGACCCAA AAACAAATTA 3 00 

CATATCTTAG GAATTCAAAA TAACATTCCA CAGCTTTCAC CAACTAGTTA TATTTAAAGG 3 60 

AGAAAACTCA TTTTTATGCC ATGTATTGAA ATCAAACCCA CCTCATGCTG ATATAGTTGG 42 0 

CTACTGCATA CCTTTATCAG AGCTGTCCTC TTTTTGTTGT CAAGGACATT AAGTTGACAT 4 80 

CGTCTGTCCA GCAGGAGTTT TACTACTTCT GAATTCCCAT TGGCAGAGGC CAGATGTAGA 54 0 

GCAGTCCTAT GAGAGTGAGA AGACTTTTTA GGAAATTGTA GTGCACTAGC TACAGCCATA 60 0 

GCAATGATTC ATGTAACTGC AAACACTGAA TAGCCTGCTA TTACTCTGCC TTCAAAAAAA 660 
AAAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 2 94: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH! 1512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 94: 

GGGTCGCCCA GGGGGSGCGT GGGCTTTCCT CGGGTGGGTG TGGGTTTTCC CTGGGTGGGG 60 

TGGGCTGGGC TRGAATCCCC TGCTGGGGTT GGCAGGTTTT GGCTGGGATT GACTTTTYTC 12 0 

TTCAAACAGA TTGGAAACCC GGAGTTACCT GCTAGTTGGT GAAACTGGTT GGTAGACGCG 18 0 
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ATCTGTTGGC TACTACTGGC TTCTCCTGGC TGTTAAAAGC AGATGGTGGT TGAGGTTGAT 24 0 

TCCATGCCGG CTGCTTCTTC TGTGAAGAAG CCATTTGGTC TCAGGAGCAA GATGGGCAAG ' 3 00 

TGGTGCTGCC GTTGCTTCCC CTGCTGCAGG GAGAGCGGCA AGAGCAACGT GGGCACTTCT 360 

GGAGACCACG ACGACTCTGC TATGAAGACA CTCAGGAGCA AGATGGGCAA GTGGTGCCGC 420 

CACTGCTTCC CCTGCTGCAG GGGGAGTGGC AAGAGCAACG TGGGCGCTTC TGGAGACCAC 4 80 

GACGAYTCTG CTATGAAGAC ACTCAGGAAC AAGATGGGCA AGTGGTGCTG CCACTGCTTC 54 0 

CCCTGCTGCA GGGGGAGCRG CAAGAGCAAG GTGGGCGCTT GGGGAGACTA CGATGACAGT 600 

GCCTTCATGG AGCCCAGGTA CCACGTCCGT GGAGAAGATC TGGACAAGCT CCACAGAGCT 660 

GCCTGGTGGG GTAAAGTCCC CAGAAAGGAT CTCATCGTCA TGCTCAGG-iA CACTGACGTG 720 

AACAAGAAGG ACAAGCAAAA GAGGACTGCT CTACATCTGG CCTCTGCC^A TGGGAATTCA 780 

GAAGTAGTAA AACTCSTGCT GGACAGACGA TGTCAACTTA ATGTCCTTGA CAACAAAAAG 84 0 

AGGACAGCTC TGAYAAAGGC CGTACAATGC CAGGAAGATG AATGTGCGTT AATGTTGCTG 900 

GAACATGGCA CTGATCCAAA TATTCCAGAT GAGTATGGAA ATACCACTCT RCACTAYGCT 960 

RTCTAYAATG AAGATAAATT AATGGCCAAA GCACTGCTCT TATAYGGTGC TGATATCGAA 1020 

TCAAAAAACA AGGTATAGAT CTACTAATTT TATCTTCAAA ATACTGAAAT GCATTCATTT 1080 

TAACATTGAC GTGTGTAAGG GCCAGTCTTC CGTATTTGGA AGCTCAAGCA TAACTTGAAT 1140 

GAAAATATTT TGAAATGACC TAATTATCTM AGACTTTATT TTAAATATTG TTATTTTCAA 12 0 0 

AGAAGCATTA GAGGGTACAG ttTTTTTTTT TTAAATGCAC TTCTGGTAAA TACTTTTGTT 1260 

GAAAACACTG AATTTGTAAA AGGTAATACT TACTATTTTT CAATTTTTCC CTCCTAGGAT 13 2 0 

TTTTTTCCCC TAATGAATGT AAGATGGCAA AATTTGCCCT GAAATAGGTT TTACATGAAA 13 80 

ACTCCAAGAA AAGTTAAACA TGTTTCAGTG AATAGAGATC CTGCTCCTTT GGCAAGTTCC 144 0 

?S^Sg JJ^^''^^^^^ GAGGTGATGC GCCTGTCAGT GGCAAGGTTT AAGATATTTC 1500 

1512 

(2) INFORMATION FOR SEQ ID NO: 2 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1853 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:295: 

GGGTCGCCCA GGGGGSGCGT GGGCTTTCCT CGGGTGGGTG TGGGTTTTCC CTGGGTGGGG 60 

TGGGCTGGGC TRGAATCCCC TGCTGGGGTT GGCAGGTTTT GGCTGGGATT GACTTTTYTC 12 0 

TTCAAACAGA TTGGAAACCC GGAGTTACCT GCTAGTTGGT GAAACTGGTT GGTAG^CGCG 18 0 

ATCTGTTGGC TACTACTGGC TTCTCCTGGC TGTTAAAAGC AGATGGTGGT TGAGGTTGAT 24 0 

TCCATGCCGG CTGCTTCTTC TGTGAAGAAG CCATTTGGTC TCAGGAGCAA GATGGGCAAG 3 00 

TGGTGCTGCC GTTGCTTCCC CTGCTGCAGG GAGAGCGGCA AGAGCAACGT GGGCACTTCT 3 60 

GGAGACCACG ACGACTCTGC TATGAAGACA CTCAGGAGCA AGATGGGCAA GTGGTGCCGC 42 0 

CACTGCTTCC CCTGCTGCAG GGGGAGTGGC AAGAGCAACG TGGGCGCTTC TGGAGACCAC 480 

GACGAYTCTG CTATGAAGAC ACTCAGGAAC AAGATGGGCA AGTGGTGCTG CCACTGCTTC 54 0 

CCCTGCTGCA GGGGGAGCRG CAAGAGCAAG GTGGGCGCTT GGGGAGACTA CGATGACAGY 600 

rrrlnrtl'^n CCACGTCCRT GGAGAAGATC TGGACAAGCT CCACAGAGCT 660 

GCCTGGTGGG GTAAAGTCCC CAGAAAGGAT CTCATCGTCA TGCTCAGGGA CACKGAYGTG 720 

JJ^^^^'^*^^ ACAAGCAAAA GAGGACTGCT CTACATCTGG CCTCTGCCAA TGGGAATTCA 780 

GAAGTAGTAA AACTCSTGCT GGACAGACGA TGTCAACTTA ATGTCCTTGA CAACAAAAJ^G 840 

AGGACAGCTC TGAYAAAGGC CGTACAATGC CAGGAAGATG AATGTGCGTT Ai^TGTTGCTG 900 

GAACATGGCA CTGATCCAAA TATTCCAGAT GAGTATGGAA ATACCACTCT RCACTAYGCT 960 

Tr^Itr^t^? AAGATAAATT AATGGCCAAA GCACTGCTCT TATAYGGTGC TGATATCGAA 1020 

J^*^^^^^^^^ CACACCACTG YTACTTGGTR TACATGAGCA AAAACAGCAA 1080 

TJi^n^l TTTTAATYAA GAAAAAAGCG AATTTAAAAT GCRCTGGATA GATATGGAAG 114 0 

RACTGCTCTC ATACTTGCTG TATGTTGTGG ATCAGCAAGT ATAGTCAGCC YTCTACTTGA 1200 

GCAAAATRTT GATGTATCTT CTCAAGATCT GGAAAGACGG CCAGAGAGTA TGCTGTTTCT 1260 
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AGTCATCATC ATGTAATTTG CCAGTTACTT TCTGACTACA AAGAAAAACA GATGTTAAAA 13 2 0 

ATCTCTTCTG AAAACAGCAA TCCAGAACAA GACTTAAAGC TGACATCAGA GGAAGAGTCA 13 8 0 

CAAAGGCTTA AAGGAAGTGA AAACAGCCAG CCAGAGGCAT GGAAACTTTT AAATTTAAAC 14 4 0 

TTTTGGTTTA ATGTTTTTTT TTTTTGCCTT AATAATATTA GATAGTCCCA AATGAAATWA 15 00 

CCTATGAGAC TAGGCTTTGA GAATCAATAG ATTCTTTTTT TAAGAATCTT TTGGCTAGGA 15 6 0 

GCGGTGTCTC ACGCCTGTAA TTCCAGCACC TTGAGAGGCT GAGGTGGGCA GATCACGAGA 1620 

TCAGGAGATC GAGACCATCC TGGCTAACAC GGTGAAACCC CATCTCTACT AAAAATACAA 16 8 0 

AAACTTAGCT GGGTGTGGTG GCGGGTGCCT GTAGTCCCAG CTACTCAGGA RGCTGAGGCA 174 0 

GGAGAATGGC ATGAACCCGG GAGGTGGAGG TTGCAGTGAG CCGAGATCCG CCACTACACT 1800 

CCAGCCTGGG TGACAGAGCA AGACTCTGTC TCAAAAAAAA AAAAAAAAAA AAA 18 53 

(2) INFORMATION FOR SEQ ID NO: 2 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2184 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 96: 

GGCACGAGAA TTAAAACCCT CAGCAAAACA GGCATAGAAG GGACATACCT TAAAGTAATA 6 0 

AAAACCACCT ATGACAAGCC CACAGCCAAC ATAATACTAA ATGGGGAAAA GTTAGAAGCA 12 0 

TTTCCTCTGA GAACTGCAAC AATAAATACA AGGATGCTGG ATTTTGTCAA ATGCCTTTTC 180 

TGTGTCTGTT GAGATGCTTA TGTGACTTTG CTTTTAATTC TGTTTATGTG ATTATCACAT 24 0 

TTATTGACTT GCCTGTGTTA GACCGGAAGA GCTGGGGTGT TTCTCAGGAG CCACCGTGTG 3 00 

CTGCGGCAGC TTCGGGATAA CTTGAGGCTG CATCACTGGG GAAGT^CAC AYTCCTGTCC 3 60 

GTGGCGCTGA TGGCTGAGGA CAGAGCTTCA GTGTGGCTTC TCTGCGACTG GCTTCTTCGG 42 0 

GGAGTTCTTC CTTCATAGTT CATCCATATG GCTCCAGAGG AAAATTATAT TATTTTGTTA 4 80 

TGGATGAAGA GTATTACGTT GTGCAGATAT ACTGCAGTGT CTTCATCTCT TGATGTGTGA 54 0 

TTGGGTAGGT TCCACCATGT TGCCGCAGAT GACATGATTT CAGTACCT6T GTCTGGCTGA 60 0 

AAAGTGTTTG TTTGTGAATG GATATTGTGG TTTCTGGATC TCATCCTCTG TGGGTGGACA 6 60 
GCTTTCTCCA CCTTGCTGGA AGTGACCTGC TGTCCAGAAG TTTGATGGCT GAGGAGTATA ■ 72 0 

CCATCGTGCA TGCATCTTTC ATTTCCTGCA TTTCTTCCTC CCTGGATGGA CAGGGGGAGC 78 0 

GGCAAGAGCA ACGTGGGCAC TTCTGGAGAC CACAACGACT CCTCTGTGAA GACGCTTGGG 84 0 

AGCAAGAGGT GCAAGTGGTG CTGCCACTGC TTCCCCTGCT GCAGGGGAGC GGCAAGAGCA 9 00 

ACGTGGTCGC TTGGGGAGAC TACGATGACA GCGCCTTCAT GGATCCCAGG TACCACGTCC 96 0 

ATGGAGAAGA TCTGGACAAG CTCCACAGAG CTGCCTGGTG GGGTAAAGTC CCCAGAAAGG 102 0 

ATCTCATCGT CATGCTCAGG GACACGGATG TGAACAAGAG GGACAAGCAA AAGAGGACTG 10 80 

CTCTACATCT GGCCTCTGCC AATGGGAATT CAGAAGTAGT AAAACTCGTG CTGGACAGAC 114 0 

GATGTCAACT TAATGTCCTT GACAACAAAA AGAGGACAGC TCTGACAAAG GCCGTACAAT 12 0 0 

GCCAGGAAGA TGAATGTGCG TTAATGTTGC TGGAACATGG CACTGATCCA AATATTCCAG 12 60 

ATGAGTATGG AAATACCACT CTACACTATG CTGTCTACAA TGAAGATAAA TTAATGGCCA 13 2 0 

AAGCACTGCT CTTATACGGT GCTGATATCG AATCAAAAAA CAAGCATGGC CTCACACCAC 13 8 0 

TGCTACTTGG TATACATGAG CAAAAACAGC AAGTGGTGAA ATTTTTAATC AAGAAAAAAG 144 0 

CGAATTTAAA TGCGCTGGAT AGATATGGAA GAACTGCTCT CATACTTGCT GTATGTTGTG 15 00 

GATCAGCAAG TATAGTCAGC CCTCTACTTG AGCAAAATGT TGATGTATCT TCTCAAGATC 15 60 

TGGAAAGACG GCCAGAGAGT ATGCTGTTTC TAGTCATCAT CATGTAATTT GCCAGTTACT 162 0 

TTCTGACTAC AAAGAAAAAC AGATGTTAAA AATCTCTTCT GAAAACAGCA ATCCAGAACA 168 0 

AGACTTAAAG . CTGACATCAG AGGAAGAGTC ACAAAGGCTT AAAGGAAGTG AAAACAGCCA 174 0 

GCCAGAGGCA TGGAAACTTT TAAATTTAAA CTTTTGGTTT AATGTTTTTT TTTTTTGCCT 1800 

TAATAATATT AGATAGTCCC AAATGAAATW ACCTATGAGA CTAGGCTTTG AGAATCAATA 1860 

GATTCTTTTT TTAAGAATCT TTTGGCTAGG AGCGGTGTCT CACGCCTGTA ATTCCAGCAC 192 0 

CTTGAGAGGC TGAGGTGGGC AGATCACGAG ATCAGGAGAT CGAGACCATC CTGGCTAACA 19 80 

CGGTGAAACC CCATCTCTAC TAAAAATACA AAAACTTAGC TGGGTGTGGT GGCGGGTGCC 2 04 0 
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TGTAGTCCCA GCTACTCAGG ARGCTGAGGC AGGAGAATGG CATGAACCCG GGAGGTGGAG 210 0 

GTTGCAGTGA GCCGAGATCC GCCACTACAC TCCAGCCTGG GTGACAGAGC AAGACTCTGT -^16 0 

CTCAAAAAAA AAAAAAAAAA AAAA ^ 

2184 

(2) INFORMATION FOR SEQ ID NO:297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1855 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 97: 



: CCCCTGAGAT 


GTGCACGCCG 




' GTAACGGCTT 


GCACGCGCAP 


ion 


• GCAGGCGCAC 


GCCGCACGCG 


J. O \J 


' GCTGCACGCG 


CGTTAACGGC 


A n 


' TGCTKGGCTK 


GGCGTTGKTY 


3 0 0 


TTGGATKGAC 


GTTTCYTYTY 


3 60 


TTGGCATTCC 


TTTGGGGTGG 


4 2 0 


GGGCGTGGGK 


CGCCCCCAGG 


480 


GTGGGGTGGG 


CTGTGCTGGG 


540 


AAACAGATTG 


GAAACCCGGA 


600 


TGCTGGTACT 


ACTGTTTCTC 


660 


GCCGGCTGCT 


TCTTCTGTGA 


720 


CGCCACTGCT 


TCCCCTGCTG 


780 


CACAACGACT 


CCTCTGTGAA 


840 


CTTCCCCTGC 


TGCAGGGGAG 


900 


AGCGCCTTCA 


TGGAKCCCAG 


960 


GCTGCCTGGT 


GGGGTAAAGT 


1020 


GTGAACAAGA 


RGGACAAGCA 


1080 


TCAGAAGTAG 


TAAAACTCGT 


1140 


AAGAGGACAG 


CTCTGACAAA 


1200 


CTGGAACATG 


GCACTGATCC 


1260 


GCTGTCTACA 


ATGAAGATAA 


1320 


GAATCAAAAA 


ACAAGGTATA 


1380 


TTTTAACATT 


GACGTGTGTA 


1440 


AATGAAAATA 


TTTTGAAATG 


1500 


CAAAGAAGCA 


TTAGAGGGTA 


1560 


GTTGAAAACA 


CTGAATTTGT 


1620 


GATTTTTTTC 


CCCTAATGAA 


1680 


AAAACTCCAA 


GAAAAGTTAA 


1740 


TCCTAAAAAA 


CAGTTU^TAGA 


1800 


TTCTGATCTC 


GTGCC 


1855 
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CLAIMS 

1 . An isolated DNA molecule, comprising: 

(a) a nucleotide sequence selected from the group consisting of SEQ ID 
NO:l, 3-26, 28-77, 142, 143. 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 
207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 
271-273, 275, 276, 278, 280, 281, 284, 288 and 291-297; 

(b) a variant of said nucleotide sequence that contains one or more 
nucleotide substitutions, deletions, insertions and/or modifications at no more than 20% of 
the nucleotide positions, such that the antigenic and/or immunogenic properties of the 
polypeptide encoded by the nucleotide sequence are retained; or 

(c) a nucleotide sequence encoding an epitope of a polypeptide encoded by 
at least one sequence selected from the group consisting of SEQ ID NO:l, 3-26, 28-77, 142, 
143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218. 
219, 221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 
280, 281, 284, 288 and 291-297. 

2. An isolated DNA molecule encoding an epitope of a polypeptide, 
wherein said polypeptide is encoded by a nucleotide sequence that; 

(a) hybridizes to a sequence selected from the group consisting of SEQ ID 
N0:1, 3-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 
207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269. 
271-273. 275, 276, 278, 280, 281, 284, 288 and 291-297 under stringent conditions; and 

(b) is at least 80% identical to a sequence selected from the group 
consisting of SEQ ID NO:l, 3-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 178-192, 
194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 250, 251, 253, 
255, 257-266. 268, 269, 271-273. 275, 276, 278, 280, 281, 284, 288 and 291-297. 
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3. An isolated DNA molecule encoding an epitope of a po^-peptide. 
wherein said polj^eptide is encoded by: 

(a) a nucleotide sequence transcribed from the sequence of SEQ ID NO: 

141; or 

(b) a variant of said nucleotide sequence that contains one or more 
nucleotide substitutions, deletions, insertions and/or modifications at no more than 20% of 
the nucleotide positions, such that the antigenic and/or immunogenic properties of the 
polypeptide encoded by the nucleotide sequence are retained. 

4. Ail isolated DNA or RNA molecule comprising a nucleotide sequence 
complementary to a DNA molecule according to any one of claims 1-3. 

5. A recombinant expression vector comprising a DNA molecule 
according to any one of claims 1-3. 

6. A host cell transformed or transfected with an expression vector 
according to claim 5. 

7. A polypeptide comprising an amino acid sequence encoded by a DNA 
molecule according to any one of claims 1-3. 

8. A polypeptide according to claim 7 wherein said polypeptide 
comprises an epitope of an amino acid sequence encoded by at least one nucleotide sequence 
selected from the group consisting of SEQ ID NO:l, 3-26, 28-77, 142, 143, 146-152, 154- 
166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-240^ 243- 
245, 247, 250, 251. 253. 255, 257-266. 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 
and 291-297. 



9. A monoclonal antibody that binds to a polypeptide according to 

claim 7. 
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10. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, at least one polypeptide according to claim 
7, and therefrom determining the presence of breast cancer in the patient. 

11. A method for determining the presence of breast cancer in a patient 
comprising detecting within a biological sample, at least one polypeptide encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 144, 145, 153, 
167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 
274, 277, 279, 282, 283, 285-287, 289, 290 and sequences that hybridize thereto under 
stringent conditions. 

12. The method of claims 10 or 11 wherein the biological sample is a 
portion of a breast tumor. 

13. The method of claim 10 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody according to claim 9. 

14. The method of claim 11 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody that binds to a polypeptide 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 
144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 252, 
256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 and sequences that hybridize 
thereto under stringent conditions. 

15. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, an RNA molecule encoding at least one 
polypeptide according to claim 7, and therefrom determining the presence of breast cancer in 
the patient. 
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16. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, at least one RNA molecule encoding a 
polypeptide encoded by a nucleotide sequence selected from the group consisting of SEQ ID 
NO: 78-86, 144, 145, 153, 167, 177, 193, 199. 205, 208, 215, 217. 220, 241. 242, 246, 248, 
249. 252. 256, 267, 270. 274. 277. 279. 282, 283. 285-287. 289, 290 and sequences that 
hybridize thereto under stringent conditions; and therefrom determining the presence of breast 
cancer in the patient. 



17. The method of claims 15 or 16 wherein the biological sample is a 
portion of a breast tumor. 



18. The method of claim 1 5 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide according to claim 7, and therefrom determining the 
presence of breast cancer in the patient. 

19. The method of claim 16 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide encoded by a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 78-86, 144, 145. 153, 167, 177, 193. 199, 205, 208, 215. 217. 220. 
241, 242. 246, 248, 249. 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 and 
sequences that hybridize thereto under stringent conditions; and therefrom determining the 
presence of breast cancer in the patient. 
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20. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) detecting an amount, in a biological sample, of at least one polypeptide 
according to claim 7 at a first point in time; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of polypeptide detected in steps (a) and (b), 
and therefrom monitoring the progression of breast cancer in the patient. 

21. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) detecting in a biological sample an amount of at least one polypeptide 
at a first point in time, the polypeptide being encoded by a nucleotide sequence selected from 
the group consisting of SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 
217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 
290 and sequences that hybridize thereto under stringent conditions; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of polypeptide detected in steps (a) and (b), 
and therefrom monitoring the progression of breast cancer in the patient. 

22. The method of claims 20 or 21 wherein the biological sample is a 
portion of a breast tumor. 

23. The method of claim 20 wherein the step of detecting comprises 
contacting a portion of the biological sample with a monoclonal antibody according to 
claim 9. 

24. The method of claim 21 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody that binds to a polypeptide-; 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 
144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 252, 



BNSDOCID: <WO 9845328A2_L> 



wo 98/45328 



PCTAJS98/06939 



146 

256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 and sequences that hybridize 
thereto under stringent conditions. 

25. The method of claim 20 wherein said polypeptide comprises an epitope 
of an amino acid sequence encoded by at least one nucleotide sequence selected from the 
group consisting of SEQ ID NO:l, 3-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 178- 
192, 194-198,200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 250, 251, 
253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 and 291-297. 

26. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) detecting an amount, within a biological sample, of at least one RNA 
molecule encoding a polypeptide according to claim 7 at a first point in time; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of RNA molecules detected in steps (a) and 
(b), and therefrom monitoring the progression of breast cancer in the patient. 

27. The method of claim 26 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide according to claim 7. 

28. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) detecting an amount, within a biological sample, of at least one RNA 
molecule at a first point in time, the RNA molecule encoding a polypeptide encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 144, 145, I53,y. 
167, 177, 193. 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 
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274, 277, 279, 282, 283, 285-287, 289, 290 and sequences that hybridize thereto under 
stringent conditions ; . 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of RNA molecules detected in steps (a) and 
(b), and therefrom monitoring the progression of breast cancer in the patient. 

29. A pharmaceutical composition, comprising a polypeptide according to 
claim 7 and a physiologically acceptable carrier. 

30. A pharmaceutical composition for inhibiting the development of breast 
cancer, comprising a polypeptide and a physiologically acceptable carrier, the polypeptide 
being encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 
78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 
252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 and sequences that hybridize 
thereto under stringent conditions, 

31. A vaccine, comprising a polypeptide according to claim 7 and an 
immune response enhancer. 

32. A vaccine, comprising a DNA molecule according to any one of 

claims 1-3. 

33. A vaccine, comprising a recombinant expression vector comprising a 
DNA molecule according to any one of claims 1-3. 

34. A vaccine for inhibiting the development of breast cancer, comprising 
a polypeptide and an immune response enhancer, the polypeptide being encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 144, 145, 153,^, 
167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 
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274, 277, 279, 282, 283, 285-287, 289, 290 and sequences that hybridize thereto under 
stringent conditions. 

35. A pharmaceutical composition according to either of claims 29 or 30, 
for use in the manufacture of a medicament for inhibiting the development of breast cancer in 
a patient, comprising administering to a patient. 

36. A vaccine according to any one of claims 31-34, for use in the 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 

37. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies according to claim 9; arid 

(b) a detection reagent. 

38. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies that bind to a polypeptide encoded 
by a nucleotide sequence selected from the group consisting of sequences provided in SEQ 
ID 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242 and 246, 248, 
249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290; and 

(b) a detection reagent. 

39. The kit of any one of claims 37 or 38 wherein the monoclonal 
antibody(s) are immobilized on a solid support. 

40. A diagnostic kit comprising two polymerase chain reaction prim.ers, at 
least one of the primers being specific for an RNA molecule according to claim 4. 

41. The kit of claim 40, wherein at least one of the polymerase chain 
reaction primers comprises at least about 10 contiguous nucleotides of an RNA molecule 
according to claim 4. 
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42. A diagnostic kit comprising two polymerase chain reaction primers, at 
least one of the primers being specific for an RNA molecule encoding a polypeptide encoded 
by a nucleotide sequence selected from the group consisting of SEQ ID NOS:78-86, 144, 
145, 153, 167. 177, 193, 199, 205, 208, 215, 217, 220, 241, 242 246, 248, 249, 252. 256, 267, 
270, 274, 277, 279, 282, 283, 285-287. 289 and 290. 

43. The kit of claim 42, wherein at least one of the polymerase chain 
reaction primers comprises at least about 10 contiguous nucleotides of an RNA molecule 
encoding a polypeptide encoded by a nucleotide sequence selected from the group consisting 
of SEQ ID NOS:78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215. 217, 220. 241, 242 
246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289 and 290. 

44. A diagnostic kit comprising at least one oligonucleotide probe, the 
oligonucleotide probe containing at least about 15 contiguous nucleotides of a DNA molecule 
according to claim 4. 



45. A diagnostic kit comprising at least one oUgonucleotide probe, the 
oligonucleotide probe comprising at least about 15 contiguous nucleotides of a DNA 
sequence selected from the group consisting of SEQ ID NOS:78-86, 144, 145, 153, 167, 177, 
193, 199, 205, 208. 215, 217, 220, 241, 242 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 
282, 283, 285-287, 289 and 290. 

46. A diagnostic kit comprising at least one oligonucleotide probe specific 
for a DNA molecule according to claim 4. 

47. The kit of claim 46, wherein the oligonucleotide probe comprises at 
least about 15 contiguous nucleotides of a DNA molecule according to claim 4. 

48. A diagnostic kit comprising at least one oligonucleotide probe specific 
for a DNA sequence selected from the group consisting of SEQ ID NOS;78-86, 144, 145, 
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153, 167, 177. 193, 199, 205, 208, 215, 217, 220, 241, 242 246, 248, 249, 252, 256, 267, 270, 
274, 277, 279, 282, 283, 285-287, 289 and 290. 

49. The kit of claim 48, wherein the oligonucleotide probe comprises at 
least about 15 contiguous nucleotides of a DNA sequence selected from the group consisting 
ofSEQIDNOS:78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208,215,217, 220, 241.242 
246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289 and 290. 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B18Ag1 



TTA GAG ACC CAA TTG GGA CCT AAT TGG GAC CCA AAT TTC TCA AGT GGA 48 
Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 
1 5 lb 15 ^ 

A? ?P lU I? F.^^ TAT CTC CTC GTG GGT ATT CAG 96 

Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu VqI Gly lie Gin 
20 25 30 

GGA GCT GCC CAG AAA CCT ATA AAC TTG TCT AAG GCG ATT GAA GTC GTC 144 
Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala lie Glu Vol VqI 
JD 40 45 



CAG 
Gin 


GGG 
Gly 
50 


CAT 
His 


GAT 
Asp 


GAG TCA 
Glu Ser 


CCA GGA GTG TTT 
Pro Gly VqI Phe 
55 


TTA 
Leu 


GAG CAC 
Glu His 
60 


CTC 
Leu 


CAG 
Gin 


GAG 
Glu 


192 


GCT 
Ala 
65 


TAT 
Tyr 


CGG 
Arg 


ATT 
He 


TAC ACC 
Tyr Thr 
70 


CCT TTT GAC CTG 
Pro Phe Asp Leu 


GCA 
Ala 
75 


GCC CCC 
Ala Pro 


GAA 
Glu 


AAT 
Asn 


AGC 
Ser 
80 


240 


CAT 
His 


GCT 
Ala 


CTT 
Leu 


AAT 
Asn 


TTG GCA 
Leu Ala 
85 


TTT GTG GCT CAG 
Phe VqI AIq Gin 
90 


GCA 

AlQ 


GCC CCA GAT 
Ala. Pro Asp 


AGT AAA 
Ser Lys 
95 


288 


AGG 
Arg 


AAA 
Lys 


CTC 
Leu 


CAA 
Gin 
100 


AAA CTA 
Lys Leu 


GAG GGA TTT TGC 
Glu Gly Phe Cys 
105 


TGG AAT GAA 
Trp Asn Glu 


TAC 
Tyr 
110 


CAG 
Gin 


TCA 
Ser 


336 


GCT 
Ala 


TTT 
Phe 


AGA 
Arg 
115 


GAT 
Asp 


AGC CTA 
Ser Leu 


AAA GGT TTT 
Lys Gly Phe 
120 












363 



Fig. 6 
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NUCLEOTIDE SEQUCNE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B17Ag1 



GC TGGGCACAGT GGCTCATACC TGTAATCCTG ACCGTTTCAG AGGCTCAGGT 60 

CG CTTGAGCCCA AGATTTCAAG ACTAGTCTGG GTAACATAGT GAGACCCTAT 120 

AA AAATAAAAAA ATGAGCCTGG TGTAGTGGCA CACACCAGCT GAGGAGGGAG 180 

CT AGGAGA 196 



Fig. 7 
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'^^^ RP°Ji?T ITn^i^ii °^ REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B17Ag2 



GC TTGGGGGCTC TGACTAGAAA TTCAAGGAAC 
AC TTACACTGTG GNCTCCAATA AACTGCTTCT 
AA GGAAAACGAT GTCTGTGTAT AGCCAAGTCA 
AT TAAATATCAG AATGTAAAAC CTGGGAACCA 
CA AGAAGACTGA ACAGTACTAC TGTGAAAAGC 
TT GAAGGATGGC TGGGAGAATG AATGCTCTGT 
CT CCTTTATAGC CTAGGAGA 



CTGGGATTCA AGTCCAACTG 60 

TTCCTATTCC CTCTCTATTA 120 

GNTATCCTAA AAGGAGATAC 180 

GGTTCCCAGC CTGGGATTAA 240 

CCGAAGNGGC AATATGTTCA 300 

CCCCCAGTCC CAAGCTCACT 360 

388 



Fig. 8 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag2a 



GC 


CTATAATCAT GTTTCTCATT 


ATTTTCACAT 


TTTATTAACC 


AATTTCTGTT 


60 


AA 


AATATGAGGG 


AAATATATGA 


AACAGGGAGG 


CAATGTTCAG 


ATAATTGATC 


120 


TG 


ATTTCTACAT 


CAGATGCTCT 


TTCCTTTCCT 


GTTTATTTCC 


TTTTTATTTC 


180 


GG 


TCGAATGTAA 


TAGCTTTGTT 


TCAAGAGAGA 


GTTTTGGCAG 


TTTCTGTAGC 


240 


CT 


GCTCATGTCT 


CCAGGCATCT 


ATTTGCACTT 


TAGGAGGTGT 


CGTGGGAGAC 


300 


CI 


ATTTTTTCCA 


TATTTGGGCA 


ACTACTA 






337 



Fig. 9 
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^^^rITa^t ruSll^^^LS^ REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag1b 



GC CATACAGTGC CTTTCCATTT 


ATTTAACCCC CACCTGAACG GCATAAACTG 


60 


GC TGGTGTTTTT TACTGTAAAC 


AATAAGGAGA CTTTGCTCTT 


CATTTAAACC 


120 


AT TTCATATTTT ACGCTCGAGG 


GTTTTTACCG GTTCCTTTTT 


ACACTCCTTA 


180 


TT TAAGTCGTTT GGAACAAGAT 


ATTTTTTCTT TCCTnnrAnr 


TTTT A AP A TT 

MM AALA 1 1 


240 


TT TGTGTCTGGG GGACTnnnn 


TCACTGTTTC TCAGAGTTGC 


AAATCAAGGC 


300 


CC AAGAAAAAAA AATTTTTTTG 


TTTTATTTGA AACTGGACCG 


GATAAACGGT 


360 


CG GCTGCTGTAT ATAGTTTTAA 


ATGGTTTATT GCACCTCCTT 


AAGTTGCACT 


420 


GG GGGGNTTTTG NATAGAAAGT 


NTTTANTCAC ANAGTCACAG 


GGACTTTTNT 


480 


NA CTGAGCTAAA AAGGGCTGNT 


TTTCGGGTGG GGGCAGATGA 


AGGCTCACAG 


540 


TC TCTTAGAGGG GGGAACTNCT 


A 




571 



Fjg. 10 



SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 9845328A2_|_> 



wo 98/45328 



PCT/US98/06939 



11/22 

NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag1a 

TA ATAACTTAAA TATATTTTGA TCACCCACTG GGGTGATAAG ACAATAGATA 60 

TT TCCAAAAAGC ATAAAACCAA AGTATCATAC CAAACCAAAT TCATACTGCT 120 

CC GCACTGAAAC TTCACCTTCT AACTGTCTAC CTAACCAAAT TCTACCCTTC 180 

GG TGCGTGCTCA CTACTCTTTT TTTTTTTTTT TTTNTTTTGG AGATGGAGTC 240 

CA GCCCAGGGGT GGAGTACAAT GGCACAACCT CAGCTCACTG NAACCTCCGC 300 

TT CATGAGATTC TCCTGNTTCA GCCTTCCCAG TAGCTGGGAC TACAGGTGTG 360 

TG CCTGGNTAAT CTTTTTTNGT TTTNGGGTAG AGATGGGGGT TTTACATGTT 420 

TG GTNTCGAACT CCTGACCTCA AGTGATCCAC CCACCTCAGG CTCCCAAAGT 480 

TA CAGACATGAG CCACTGNGCC CAGNCCTGGT GCATGCTCAC TTCTCTAGGC 540 

548 



Fig. 11 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA BlIAgl 



TG 


CACATGCAGA 


ATATTCTATC 


GGTACTTCAG PTATTArTrA TTTTnAinnr 


bU 


AG 


CCTATCCTCA 


AGATGAGTAT 


TTAGAAAGAA TTHATTTAnr nATArArPAA 


1 on 


GC 


ACTCTGACTA 


CACGAAATTG 


TTCAGATnTn ATHHATTTAT nAPArxTrAT 


180 


GA 


GATTATTAAG 


TGATTATTTT 


AAAGGGAATP PATTAATTPP ATAATAtPTT 


c4U 


TC 


AAGATGATAT 


AGAAATAGAA 


C AG A A AG AG A PTAPAAATHA AnATPTATPA 


JUU 


T A 
1 A 


TTGAAGAGCC 


TATAGTAGAA 


AATGAATTAG CTGCATTTAT TAGCCTTACA 


360 


IT 


TTCCTGATGA 


ATCTTATATT 


CAGCCATCGA CATAGCATTA CCTGATGGGC 


420 


GA 


ATAATAGAAA 


CTGGGTGCGG 


GGCTATTGAT GAATTCATCC NCAGTAAATT 


480 


AC 


AAAATATAAC 


TCGATTGCAT 


TTGGATGATG GAATACTAAA TCTGGCAAAA 


540 


GG 


AGCTACTAGT 


AACCTCTCTT 


TTTGAGATGC AAAATTTTCT TTTAGGGTTT 


600 


CT 


ACTTTACGGA 


TATTGGAGCA 


TAACGGGA 


638 



Fig- 12 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3c 



ACTGATGGAT GTCGCCGGAG GCGAGGGGCC TTATCTGATG CTCGGCTGCC TGTTCGTGAT 60 

GTGCGCGGCG ATTGGGCTGT TTATCTCAAA CACCGCCACG GCGGTGCTGA TGGCGCCTAT 120 

TGCCTTAGCG GCGGCGAAGT CAATGGGCGT CTCACCCTAT CCTTTTGCCA TGGTGGTGGC 180 

GATGGCGGCT TCGGCGGCGT TTATGACCCC GGTCTCCTCG CCGGTTAACA CCCTGGTGCT 240 

TGGCCCTGGC AAGTACTCAT TTAGCGATTT TGTCAAAATA GGCGTG 286 



Fig. 13 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG1 

AG CAGCCCCTTC TTCTCAATTT CATCTGTCAC TACCCTGGTG TAGTATCTCA 60 

CA TTTTTATAGC CTCCTCCCTG GTCTGTCTTT TGATTTTCCT GCCTGTAATC 120 

AC ATAACTGCAA GTAAAGATTT CTAAAGTGTG GTTATGCTCA TGTCACTCCT 180 

AA ATAGTTTCCA TTACCGTCTT AATAAAATTC GGATTTGTTC TTTNCTATTN 240 

CA CCTATGACCG AA pap 



Fig. 14 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG3 

AG CAAAGCCAGT GGTTTGAGCT CTCTACTGTG TAAACTCCTA AACCAAGGCC 60 

TA AATGGTGGCA GGATTTTTAT TATAAACATG TACCCATGCA AATTTCCTAT 120 

GA TATATTCTTC TACATTTAAA CAATAAAAAT AATCTATTTT TAAAAGCCTA 180 

AG TTAGGTAAGA GTGTTTAATG AGAGGGTATA AGGTATAAAT CACCAGTCAA 240 

TG CCTATGACCG A 261 



Fig. 15 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B2CA2 



GG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

AT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 120 

GG ANAAGGCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 

CG NCTTGCNANG ATCTTCAT 208 



Fig. 16 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA1 



GG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

AT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 120 

GG ANAAGGCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 

CG NCTTGCNANG ATCTTCAT 208 



Fig. 17 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA2 

GG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

AT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 180 

GG ANAAGGCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 

CG NCTTGCNANG ATCTTCAT pnp 



Fig. 18 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3 

AG GGAGCAAGGA GAAGGCATGG AGAGGCTCAN GCTGGTCCTG GCCTACGACT 60 

CT GTCGCCGGGG ATGGTGGAGA ACTGAAGCGG GACCTCCTCG AGGTCCTCCG 120 

TO NCCGTCCAGG AGGAGGGTCT TTCCGTGGTC TNGGAGGAGC GGGGGGAGAA 180 

TC ATGGTCNACA TCCC 204 



Fig, 19 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B4CA1 

TC AGGAGCGGGT AGAGTGGCAC CATTGAGGGG ATATTCAAAA ATATTATTTT 60 

TG ATAGTTGCTG AGTTTTTCTT TGACCCATGA GTTATATTGG AGTTTATTTT 120 

CC AATCGCATGG ACATGTTAGA CTTATTTTCT GTTAATGATT NCTATTTTTA 180 

GA TTTGAGAAAT TGGTTNTTAT TATATCAATT TTTGGTATTT GTTGAGTTTG 240 

GC TTAGTATGTG ACCA oca 



Fig, 20 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1.2,4-10,12.13,15.17,18,20,22,23,25-27,29,31-33. 
35-37.39-41,44,46.47 (all partially) 



Invention 1: 

Isolated DNA molecule comprising a nucleotide sequence with 
Seq.ID 1, the corresponding polypeptide, an isolated DNA or 
RNA molecule comprising a nucleotide sequence complementary 
to seq. ID 1, a recombinant expression vector comprising 
that molecule, a host cell transfected with that vector, a 
monoclonal antibody binding to the polypeptide encoded by 
Seq. ID 1. as well as pharmaceutical compositions, vaccines, 
diagnostic methods used in the diagnosis and therapy of 
breast cancer. 



2. Claims: 1,2,4-10,12,13,15,17,18.20,22,23,25-27,29,31-33, 
35-37,39-41,44.46,47 (all partially) 



Invention 2-75: 

Isolated DNA molecule comprising a nucleotide sequence with 
Seq.ID 3, the corresponding polypeptide, an isolated DNA or 
RNA molecule comprising a nucleotide sequence complementary 
to Seq. ID 3, a recombinant expression vector comjjrising 
that molecule, a host cell transfected with that vector, a 
monoclonal antibody binding to the polypeptide encoded by 
Seq. ID 3, as well as pharmaceutical compositions, vaccines, 
diagnostic methods used in the diagnosis and therapy of 
breast cancer. 

...ibidem for each of sequences 4-26, 28-77 



3. Claims: 3 (completely), 4-10,12,13,15,17,18.20.22,23,25-27, 
29.31-33,35-37,39-41,44,46,47 (all partially) 



Invention 76: 

Isolated DNA molecule comprising a nucleotide sequence with 
Seq.ID 141, the corresponding polypeptide, an isolated DNA 
or RNA molecule comprising a nucleotide sequence 
complementary to Seq. ID 141, a recombinant expression 
vector comprising that molecule, a host cell transfected 
with that vector, a monoclonal antibody binding to the 
polypeptide encoded by Seq. ID 141, as well as 
pharmaceutical compositions, vaccines, diagnostic methods 
used in the diagnosis and therapy of breast cancer - 



4. Claims: 1,2,4-10,12,13,15,17,18,20,22.23,25-27.29.31-33. 
35-37,39-41.44.46.47 (all partially) 
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Invention 77-200: 

Isolated DNA molecule comprising a nucleotide sequence with 
Seq.ID 142, the corresponding polypeptide, an isolated DNA 
or RNA molecule comprising a nucleotide sequence 
complementary to Seq. ID 142, a recombinant expression 
vector comprising that molecule, a host cell transfected 
with that vector, a monoclonal antibody binding to the 
polypeptide encoded by Seq. ID 142, as well as 
pharmaceutical compositions, vaccines, diagnostic methods 
used in the diagnosis and therapy of breast cancer. 

...ibidem for each of sequences 

143,146-152,154-166,168-176,178-192,194-198,200-204.206,207,2 
09-214,216,218,219,221-240,243-245,247,250,251,253,255,257-26 
6,268,269,271-273,275,276,278,280,281,284,288 and 291-297 



5. Claims: 11,12,14,16.17,19,21,22,24.30,34,35,36.38,39,42,43, 
45,48,49 (all partially) 



Invention 201-241: 

Isolated DNA molecule comprising a nucleotide sequence with 
Seq.ID 78, the corresponding polypeptide, an isolated DNA or 
RNA molecule comprising a nucleotide sequence complementary 
to Seq. ID 78, a recombinant expression vector comprising 
that molecule, a host cell transfected with that vector, a 
monoclonal antibody binding to the polypeptide encoded by 
Seq. ID 78, as well as pharmaceutical compositions, 
vaccines, diagnostic methods used in the diagnosis and 
therapy of breast cancer. 

...ibidem for each of sequences 

78-86,144,145,153,167,177.193,199,205,208,215.217,220,241,242 
^246,248,249,252,256,267,270,274,277,279,282,283,285-287.289, 
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